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Author's Abstract 

The standard explanations of the theory underlying Bezier and B-spline curves 
and surfaces aren't as simple as they should be. They have beautiful figures, and 
those figures clearly illustrate the geometry of the associated algorithms. But the 
labels on the points in the figures are a disaster! In particular, there is no easy way 
to tell, from the labels, what geometric relationships hold among the labeled points. 
This paper proposes a new labeling scheme, based in large part on the work of Paul 
de Faget de Casteljau. 

The key idea behind this new labeling scheme is a classical mathematical prin- 
ciple: a polynomial in a single variable of degree n is essentially equivalent to a 
symmetric polynomial in n variables that is linear in each variable separately. For 
example, a quadratic form on a vector space is essentially equivalent to a symmet- 
ric bilinear form on that vector space. We christen this principle the Blossoming 
Principle, and we refer to the bilinear form as the blossom of the quadratic form. 
The advantage of blossoming is that bilinear maps are simpler than quadratic maps. 
When even bilinear maps aren't simple enough, we can use the tensor-product con- 
struction to convert the bilinear blossom into a linear blossom, defined on a space 
of tensors. 

The Bezier curves and surfaces that are used in computer-aided geometric de- 
sign are polynomial functions from a parameter space to an object space. By apply- 
ing the Blossoming Principle to them, we can transform each such function into its 
blossom, which is a symmetric, multiaffine function. The blossom of a polynomial 
curve or surface provides lucid labels both for its Bezier points and for all of the 
intermediate points that arise during the de Casteljau Algorithm. 

Furthermore, as de Casteljau discovered, the blossoming technology extends to 
handle spline curves with parametric continuity quite neatly. The blossom of such 
a spline curve provides lucid labels both for its de Boor points and for all of the 
intermediate points that arise during the de Boor Algorithm. Spline curves with 
geometric continuity and spline surfaces with triangular patches present unsolved 
labeling challenges, however. 



Capsule Review 

A Bezier curve is a parametric polynomial curve. For example, the parametric 
equations x = 4t z + 3t 2 - 6*+ 5 and y = t z - 4t 2 + 2t - 1 define a cubic Bezier curve in 
the x-y plane. A spline curve is a curve consisting of a sequence of segments of Bezier 
curves, where consecutive segments are constrained to meet at joints with some 
degree of continuity. Spline curves and the analogous spline surfaces are commonly 
used in computer aided design to model curved shapes ranging from typographic 
fonts to automobile bodies. 

This paper presents a new perspective on Bezier and spline curves, based on 
a one-to-one correspondence between univariate polynomials of degree k and sym- 
metric A;-variate polynomials of degree 1 in each variable. (For example, the cubic 
polynomial F(t) = At 3 + Zt 2 - 6t + 5, which defines the x coordinate as a function of 
the parameter t in the curve mentioned earlier, is given by F(t) = /(£,*,£), where 
/(u, v, w) = 4uvw + uv + uw + vw - 2u - 2v - 2w + 5.) Using this perspective, the 
author derives the standard constructions for Beziers and splines in a manner that 
gives perspicuous labels to the points arising at all the steps of these constructions, 
and discusses the conditions that govern the degree of continuity achieved at the 
joints of splines. He also explores a variety of related topics, including characteriza- 
tion of lower-degree Bezier curves as degenerate higher-degree curves, approximation 
of higher-degree Bezier curves by lower-degree curves, and extensions of the theory 
from curves to surfaces, from parametric continuity to geometric continuity, and 
from parametric polynomial functions to parametric rational functions. A number 
of tantalizing open questions remain concerning these extensions. 

The text is intended to be self-contained, requiring no previous study of the 
theory of splines, and it succeeds in this goal. The key ideas are carefully introduced, 
examined from multiple points of view, and illustrated with numerous figures and 
examples. The text does, however, demand of the reader a certain ability to deal 
comfortably with mathematical abstactions — the quality that my college math pro- 
fessors referred to as "mathematical maturity." Less mathematically inclined read- 
ers may prefer to study the author's shorter paper [1] on splines, which presents a 
subset of the results given here and is less demanding mathematically. The reader 
who puts forth the necessary effort in studying the present text will be not only 
rewarded with considerable insight into the theory of Beziers and splines, but also 
treated to a variety of interesting mathematical sidelights, such as an unusual proof 
of the theorem that the medians of a triangle trisect each other and a discussion of 
Zeeman's "umbilic bracelet" (characterizing the set of cubics with three real roots). 

Jim Saxe 

[1] Lyle Ramshaw. "Beziers and B-splines as multiaffine maps." To appear in 
Theoretical Foundations of Computer Graphics and CAD, the proceedings of 
a NATO International Advanced Study Institute in Lucca, Italy during July, 
1987. Springer- Verlag (1987). 
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Part A: Introduction 



This is a mathematical paper that presents, in seven parts, a novel approach to 
the study of spline curves and surfaces. One important benefit of the new approach 
is a distinctive scheme for labeling the points in diagrams of splines. The primary 
intent of this introductory Part A is to give the reader a taste of the glories of that 
labeling scheme. 

1. Some intriguing labels 

One of the basic problems in computer-aided geometric design is the mathe- 
matical modeling of smooth shapes. A popular approach to this problem involves 
the use of piecewise parametric polynomial functions, which are called splines. Each 
of the three adjectives "piecewise," "parametric," and "polynomial" reflects a choice 
of strategy. 

The choice of a piecewise approach means that different regions of the smooth 
curve or surface being modeled are specified by different systems of equations. This 
is convenient because it allows for local flexibility: one region of the shape can be 
adjusted without affecting distant regions. In order to guarantee the smoothness 
of a piecewise model, appropriate constraints must be placed on adjacent pairs of 
regions so as to achieve a smooth joint between one region and the next. 

Parametric methods are modeling methods that describe a shape in terms of 
auxiliary parameters. For example, the formula F(t) := (cos(i),sin(*)) is a para- 
metric description of the unit circle in the plane, with t as the parameter,* The 
other popular class of modeling methods is implicit methods. The unit circle is 
described implicitly by the equation G(x,y) = 1, where G(x,y) := x 2 + y 2 . The 
choice between parametric and implicit methods arises because the primordial ob- 
jects of mathematics are functions rather than shapes. Suppose that we want to 
model a p-dimensional shape S sitting in a ^-dimensional surrounding space Q. In 
parametric methods, we invent an auxiliary parameter space P of dimension p and 
we model the shape S as the range S ~ F(P) of a function F:P ->Q. In implicit 
methods, on the other hand, we invent an auxiliary space R of dimension r := q-p 
and we model the shape S as the inverse image S := G _1 (z), under some function 
G:Q -4 i?, of some chosen point z in R. Both schemes involve specifying the shape 
S as more than just a set. In parametric methods, each point x on the shape S has 
an associated parameter value J F~ 1 (x), while, in implicit methods, each point y not 
on the shape S has an associated value <3(y). Parametric and implicit methods are 
both useful and important, but this paper is concerned exclusively with parametric 
methods. 

A function F: P — > Q is called a polynomial function of degree n if each coor- 
dinate of the result point F(u) in Q can be expressed as a polynomial of degree 
n in the coordinates of the argument point u in P. (Throughout this paper, we 
shall interpret the phrase "of degree n" to mean "of degree n or less," a concept 

* We shall use the relation symbol a :=" in equations that are being used to define 
the symbol on the left-hand side. By contrast, equations written with the standard 
equality symbol "=" assert a relationship between quantities defined elsewhere. 
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that is often described by the phrase "of order n + 1") Since polynomials of low 
degree are easy to evaluate and to differentiate, the class of polyomial functions of 
degree n is one obvious choice to use when building models of smooth shapes, and 
it is the one on which we shall focus in this paper. Rational functions of degree 
n are another reasonable choice, which we shall discuss briefly in Part G. Various 
classes of transcendental functions could also be used; for example, the function 
F(t) := (cos(*),sin(i)) mentioned above is a transcendental parametric model of 
the unit circle. By the way, the unit circle also has quadratic rational parametric 
models, such as F(t) := <(l - t 2 )/(l + 1 2 ), 2i/(l + 1 2 )>. But, as we shall see shortly, 
the unit circle has no polynomial parametric model of any finite degree. 

To reiterate, our overall plan is to model smooth shapes as the ranges of piece- 
wise polynomial functions. For the rest of Section 1, we shall restrict ourselves, 
for simplicity, to the case of smooth curves, and we shall choose the real num- 
bers P := R to be the parameter space. That is, we shall build spline curves by 
assembling segments F ([«,«]) of n " ic polynomial curves F:H — ► Q. We begin by 
considering a single curve segment in isolation. 

What if n, the degree bound, is one? If F:R -> Q is a polynomial curve 
of degree one, a segment F([s y t]) is a line segment. The obvious way to specify 
such a segment is by giving its endpoints F(s) and F{t) as points in the 

object space Q. Every choice of two endpoints determines a corresponding function 
F uniquely, and in a geometrically intuitive and numerically stable manner. (If it 
happens that F(s) = F(t), the associated function F will be of degree zero, that 
is, constant, as well as being of degree one.) If we know the endpoints F(s) and 
F(t), we can compute any other point F(u) for u in by performing a linear 
interpolation. 

The parametric polynomial curves F: R — > Q of degree two are more interesting. 
First, since the acceleration vector F"(u) is constant, we observe that the curve F 
must lie in a plane in Q. Everyone knows that the plane curves that can be described 
implicitly by a polynomial of degree two are precisely the conic sections. Indeed, 
any conic section lying in any plane in the object space Q can be described implicitly 
by using q — 2 linear polynomials to determine the plane and then one quadratic 
polynomial to determine the conic. But we are interested in parametric modeling 
methods, and it turns out that only parabolas can be parameterized with quadratic 
polynomials. If each coordinate of the point F(u) is given by a polynomial in u, 
then the curve F must go off to infinity in precisely one direction (or its opposite) 
as u — ► ±oo. Thus, ellipses are out because they don't go to infinity at all, while 
hyperbolas are out because they go to infinity in more than one direction. 

What is a good way to specify an arc F{[s y t]) of a parabola F: R -» Q? Fig. 1.1 
shows one technique, in which the segment F([0, 1]) is determined by specifying three 
points in Q, called the Bizitr points of the parabolic arc F([0, 1]): the starting point 
jF(0), the point where the starting and ending tangents intersect, and the ending 
point F(l). In Fig. 1.1, these three Bezier points are labeled /(0,0), /(0,1), and 
/(1,1). If we want to compute the point F(t) for t in [0,1] from the three Bezier 
points, we can do so by performing three linear interpolations as shown. First, 
we interpolate between /(0,0) and /(0,1) to find f(0,t). Second, we interpolate 
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Fig. 1.1. The de Casteljau Algorithm for a quadratic Bezier curve 




Fig. 1.2. The de Casteljau Algorithm for a cubic Bezier curve 



between /(0,1) and /(1,1) to find /(*,1). Finally, we interpolate between /(0,i) 
and /(f , 1) to find F(t) = /(*, t). This algorithm for computing F(t) from the Bezier 
points of the segment F([0, 1]) is called the de Casteljau Algorithm. Bezier points 
and the de Casteljau Algorithm are standard material; Bohm, Farin, and Kahmann 
have written a fine survey of this area [5], The function /:R 2 — ► Q appearing in the 
labels in Fig 1.1 is not standard, however; it is our first example of a blossom. In the 
quadratic case, we can think of the blossom value /(u,v) as the point of intersection 
of the tangent lines to the parabola at F(u) and F(v). 

Fig. 1.2 shows a segment F([0> 1]) of a polynomial cubic curve. The four points 
labeled /(0,0,0), /(0,0,1), /(0,1,1), and /(1,1,1) are the four Bezier points of this 
cubic segment. (The cubic F shown in Fig. 1.2 is degenerate in the sense that it lies 
in a plane; a nondegenerate polynomial cubic is a twisted space curve, and the four 
Bezier points of a segment of such a cubic would not be coplanar.) The remaining 
points and lines are the result of using the de Casteljau Algorithm to compute the 
point F(t) — f(t y t,t) by doing six linear interpolations, starting with the Bezier 
points. The function f(u,v y w) appearing in the labels is the blossom of F. This 
blossom /:R 3 — ► Q is a symmetric function of three real arguments; that is, the 
value /(u,v,«;) doesn't depend on the order of u, v, and w. Furthermore, the 
blossom / is related to the curve F by the identity F(u) = /(u,u,u). Note that the 
incidence structure of the points and lines in Fig. 1.2 is reflected in the labels: two 
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Fig. 1.3. The Bohm-Sablonniere Rules for a cubic spline curve 



points lie on a common line in Fig. 1.2 if and only if two out of the three arguments 
in their labels are the same. The distance ratios are also implied by the labels: for 
example, the point /(t,t, 1) lies t of the way from /((),*, 1) to /(l,t,l). (Symmetry 
implies that /(l,t,l) = /(*iM)0 More generally, whenever we vary one of the 
arguments of / while holding the other two fixed, the value of / varies along a 
straight line at a constant rate — in fancier language, the function / is triaffine. 

In fact, the labels in Fig. 1.2 have so many nice properties that they capture 
all of the structural information in the diagram. Suppose that we were told that 
/ was a symmetric, triaffine function and that the four outer vertices in Fig. 1.2 
were the points /(0,0,0), /(0,0,l), /(0,1,1), and /(1,1,1). And suppose that 
we wished to compute /(t, t). We would be able to reconstruct Fig. 1.2 from 
this information, even if we had never heard of Bezier points or the de Casteljau 
Algorithm. We would linearly interpolate between the four outer points, taken in 
pairs, to compute the three points that have one t in their labels. A second stage 
of interpolations, taking these three points in pairs, would give us the two points 
/(0,t,*) and /(t,f, !)• A thin* stage would use a single interpolation would give us 
the desired point /(*,*>*) = ^(*)- 

Labels this good are worthy of study. Our first major goal in this paper is to 
construct the blossom / of an n-ic polynomial curve or surface and to explore the 
connection between this blossom and the Bezier-de Casteljau theory of polynomial 
curves and surfaces. The other major goal is to investigate the extent to which the 
same ideas can be extended to parametric curves or surfaces with more than one 
polynomial piece, that is, to splines. 

Fig. 1.3 shows a spline curve G defined on the interval [0,6] and composed of 
five pieces. The curve G(u) follows one polynomial cubic for u in [0, 1], then switches 
over to a different polynomial cubic for u in [1,2], and so on for [2,3], [3,4], and 
[4, 6]. (The fifth parameter interval is twice as long as the rest; in general, parameter 
intervals can have arbitrary lengths.) The joints between each adjacent pair of cubics 
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Fig. 1.4. The de Boor Algorithm for a cubic spline curve 



have C 2 continuity; that is, no jump is allowed in position, velocity, or acceleration, 
although a jump in jerk is permitted. (Jerk is a name from the automobile industry 
for the third derivative of position with respect to time.) But the behavior at the 
endpoints u = 0 and u = 6 is entirely unconstrained. This state of affairs is described 
in the trade by saying that the knot sequence of G is (0,0,0,0,1,2,3,4,6,6,6,6). As 
before, the labeling function g(u,v 7 w) is a symmetric function of three arguments, 
related to the spline G by the identity G(u) = y(u,u,u). Furthermore, the function 
g behaves triaffinely. For example, as u varies from 2 to 6, the point y(u,3,4) 
moves at a constant rate along the line segment joining y(2,3,4) to y(6,3,4). The 
outermost points in this diagram have labels of the form g(r,8,t) where (r,s,i) is 
a triple of consecutive knots. They are precisely the de Boor points by which the 
spline G is controlled in the standard theory [5]. Furthermore, for each pair (s,t) of 
consecutive knots, the Bezier points of the cubic polynomial segment G([s 9 1\) are the 
points g{s,s y s) } g(s } s,t), g{s,t y t) 9 and g{t,t y t). Thus, the multiaffine labels help to 
clarify the relationship between the de Boor points of a spline curve and the Bezier 
points of its segments, a relationship first explored by Bohm [3] and Sablonniere [37]. 
(The initial and final knots 0 and 6 are repeated four times, instead of only three, 
because even the position of G(u) is allowed to jump discontinuously as u moves 
through 0 or 6. This four-fold repetition means that there are two different de Boor 
points labeled ^(0,0,0); the first of them gives lim u | 0 G r (u), while the second gives 
lim u jo <2(u). Only the second is shown in Fig. 1.3.) 

A word of warning: since G(u) is a cubic spline curve rather than a single 
polynomial cubic, the function g in Fig. 1.3 can't be as simple as the / in Fig. 1.2. 
The extra complexity shows up in the fact that g(u y v,w) is well-defined only when 
its three arguments are fairly close together — in particular, when they don't skip 
over any knots. For example, the value $(u,3,4) is well-defined only for u between 
2 and 6. 

Fig. 1.4 shows the same spline curve G as in Fig. 1.3, but with the auxiliary 



6 



INTRODUCTION 



1.4 



points and lines that we would draw to compute G(2.6) = 3(2.6,2.6,2.6). Starting 
from the de Boor points, we construct new points that have more and more copies of 
2.6 among their j-arguments by performing linear interpolations between previously 
constructed points. This algorithm for evaluating a spline curve is called the de 
Boor Algorithm [5].* With these labels on the points, the de Boor Algorithm seems 
like a fairly obvious extension of the de Casteljau Algorithm. Indeed, one of the 
best reasons to use labels based on blossoms is the clarification that they bring to 
parametrically continuous spline curves. 

Where did blossoming come from, and who deserves the credit? We shall close 
this introduction by considering those historical questions briefly. 

As applied to a single polynomial function, blossoming is a standard mathe- 
matical tool in every respect except for the name "blossoming," which I coined for 
this paper. For example, mathematicians have known for some time that there is 
a unique symmetric, bilinear form associated with any quadratic form on a vector 
space [28, 32]. That theorem is precisely the homogeneous, degree two case of blos- 
soming. Furthermore, there is nothing tricky about transferring that result to the 
nonhomogeneous, degree n case, which is the case that is needed to transform a 
polynomial curve such as the F of Figs. 1.1 or 1.2 into its multiaffine blossom /. 

Paul de Faget de Casteljau was the first to apply blossoming to the polynomial 
curves and surfaces used in computer-aided geometric design. He also gets the credit 
for discovering that the blossoming technology extends very neatly to parametrically 
continuous spline curves, as described in his 1984 notes entitled "Formes a Poles," 
that is, shapes specified by poles [14]. A pole, in de Casteljau's terminology, is a point 
such as /(0, 0, 1) or y(2, 3, 4), that is, a value of a blossom. Someone familiar with de 
Casteljau's work wouldn't have found anything new in this introduction, except for 
the terminology and notation that I developed while independently stumbling upon 
the same ideas. But many of the techniques and results in the subsequent sections, 
such as the use of the symmetric variant of the tensor-product construction, appear 
for the first time in these pages (to the best of my knowledge). 

Carl de Boor and Klaus Hollig have been approaching blossoming in their recent 
work on B-splines. In the traditional theory, B-splines are defined by means of a 
formula involving divided differences. From this formula, one proves a recurrence 
relation. And from the recurrence relation, one derives the de Boor Algorithm. In 
their paper "B-Splines Without Divided Differences," de Boor and Hollig showed 
that one can take the recurrence relation as basic [13]. In this paper, we shall explore 
the remaining possibility, taking the de Boor Algorithm as basic and deriving the 
recurrence relation and the divided-difference formula from it. One consequence 
of this change of perspective is that primal and dual are reversed. In the work 
of de Boor and Hollig, B-splines are primal objects, and their duals are called the 
dual functional*, written A» tn +i. In this paper, the same objects that de Boor 
and Hollig refer to as dual functional appear as a primal objects, in particular, 

* Comparing our Fig. 1.4 with the corresponding Fig. 18 in the survey paper [5], 
it is amazing how well their figure is able to communicate the de Boor Algorithm 
under the severe handicap that its points are just labeled "dj". 
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as the n-contravariant tensors A» in +i = • • • t^+ n . B-splines are the duals of 
these n-contravariant tensors, that is, they are n-covariant tensors, as described in 
Remark 18.13. 

In a fuzzier sense, the large body of work on algorithms for knot insertion wets 
also moving in the general direction of blossoming. A spline curve, like the G(u) 
in Fig. 1.4, is a function; the de Boor Algorithm provides a way to evaluate that 
function, that is, to evaluate blossom values of the form g(u, u, u), whose arguments 
are all equal. Knot-insertion algorithms took a step closer to blossoming by widening 
the scope of discourse to include blossom values g(u, v, w) whose arguments are not 
equal. In particular, one way to compute the blossom value g(u, v,w) is to use a 
knot-insertion algorithm to insert u, i>, and w as knots (assuming that u, v, and 
w are close enough together to make ^(u,v,ty) well-defined). On the other hand, 
knot-insertion algorithms also took a step away from blossoming by working with 
sequences of knots and sequences of control points, hence foolishly and needlessly 
giving up on the concept of the spline as a function. Blossoming is what results 
when we take the spline G(u) — j(u,u,u) and allow ourselves to evaluate it, as a 
function, on n-tuples of parameter values (u,v,u;) whose components are not all 
equal. 



Part B: The Three Principles 

The theory behind the intriguing labels of Section 1 depends upon three classical 
mathematical principles, which we shall review and customize for our purposes in 
this part. The Blossoming Principle is the first of these, fairly easy to understand 
and yet very powerful. The Tensoring Principle comes next; it is technically more 
sophisticated, and its payoff is not as dramatic. But, once we have invested the 
necessary effort, it will reward us with valuable algebraic insights into polynomial 
curves and surfaces. Third is the Homogenizing Principle, which is both simple and 
familiar; it comes into play when differentiation is involved. While investigating 
these three principles, we shall also study the relationships between them and the 
standard techniques that are used for controlling polynomial curves and surfaces in 
computer-aided geometric design. 

2. The Blossoming Principle 

The Blossoming Principle is the core idea behind these new labelings. It allows 
us to replace a function of high degree in one variable by an equivalent function of 
degree one but with lots of variables. 

Consider the cubic polynomial F(t) = 7i 3 + 6t 2 - 3t + 5. Suppose that we want 
to construct a trivariate polynomial /(u,v,u;) of the form 

/(u, v, w) = r x uvw + r 2 uv + r 3 uw + r 4 vw + r 5 u + r 6 v + r 7 w + r 8 

that satisfies the identity f(t, t 9 t) = F(t)\ in other words, we want the main diagonal 
of /(u, v, w) to agree with F. To achieve this correspondence, we must have r x = 7, 
r 2 + r s + r 4 = 6, r 5 + r 6 + r 7 = -3, and r 8 = 5. We can determine / uniquely if we 
also demand that /(u,v,tt;) be a symmetric function of its three arguments. This 
symmetry condition forces us to make r 3 = r$ = r 4 and r 5 = r 6 = r 7 , resulting in 
the unique choice 

/(u, v, w) — 7uvw + 2uv + 2uw + 2vw - u- v- ty + 5. 

This argument can be generalized quite a bit; but first, a question of nomen- 
clature. What shall we call a polynomial, such as the / above, that has degree one 
in each of its variables separately? The name "multilinear" pops to mind, but there 
is the problem that the word "linear" often implies "homogeneous" as well as "of 
degree one." (A polynomial is homogeneous of degree n when all of its terms have 
total degree precisely n; in particular, / would have to have r 2 = r 3 = • • • = r 8 = 0 
in order to be multilinear in the homogeneous sense.) To avoid confusion, we shall 
use the word "affine" instead of "linear" when we mean "of degree one, but not 
necessarily homogeneous." In particular, we shall say that / is multiaffine. 

In fact, there are two parallel worlds in which mathematics of this flavor can 
be done: the linear world and the affine world. The linear world consists of linear 
spaces (also called vector spaces) and homogeneous linear maps between them. The 
natural notion of a polynomial map in this world demands that the polynomials in- 
volved also be homogeneous. The less familiar affine world consists of affine spaces 
and affine maps between them. An affine space is like a linear space except that no 
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point has been distinguished as the origin. An affine map is of degree one, but it 
is not necessarily homogeneous; indeed, since affine spaces don't have distinguished 
origins, it doesn't really make sense to ask whether an affine map is homogeneous or 
not. Similarly, in the affine world, the natural notion of polynomial maps does not 
demand that they be homogeneous. In this paper, we shall focus on the affine world 
more than most mathematicians typically do, because it is the home of the para- 
metric polynomial curves and surfaces that are used in computer-aided geometric 
design. If F: P Q is such a curve or surface, note that neither the parameter space 
P nor the object space Q has a distinguished origin, and note that the polynomials 
involved in defining F are generally not homogeneous. 

With this nomenclature, we can describe the phenomenon exemplified by F 
and / quite concisely: if F(t) is any cubic polynomial, there exists a unique sym- 
metric triaffine polynomial /(u,v,u;) that agrees with F on the diagonal, that is, 
that satisfies the identity f (t y t,t) = F(t). Going backwards from / to F is even 
easier: if f(u,v y w) is triaffine, the formula F(t) := f{t,t } t) always defines a cubic 
polynomial F, whether or not / is symmetric, and without any need to manipulate 
coefficients. Thus, univariate cubic polynomials and symmetric triaffine polynomials 
are equivalent concepts; they are just different forms of the same underlying math- 
ematical object. That doesn't mean, however, that they are equally easy to work 
with in specific situations. We shall find that, in many cases, the symmetric triaffine 
version reveals the underlying structure more clearly than does the cubic version of 
the same abstract entity. Based on the intuition that the structure bound up in F 
is unrolled and revealed in /, we shall refer to the process of deriving / from F as 
blossoming, and we shall refer to / as the multiaffine blossom of F. (You may or 
may not like the name "blossoming," but I certainly feel that some name should be 
chosen. Mathematicians seem mysteriously content to refer to / as "the symmetric 
triaffine function that corresponds to F f n where the sense of the correspondence 
must be inferred from the context. I prefer the phrase "the blossom of F") 

Any function defined by polynomials can be blossomed. For computer graphics 
applications, we shall extend our blossoming expertise in two ways. First, we shall 
generalize from degree three to degree n. Second, we shall deal with functions from 
one multidimensional affine space to another, rather than with functions from the 
reals to the reals. We begin by stating some definitions. 

Let P be a parameter space and Q be an object space, both affine spaces over 
the real numbers R. Addition and scalar multiplication aren't defined as separate 
operations in an affine space. Instead, there is a single operation of "taking an affine 
combination" : given two points u and v in an affine space and any real number r, 
there is a well-defined point (l-r)u-f rv, which we can think of as "r of the way from 
u to v." A function F:P Q is called affine if it preserves affine combinations, 
that is, if the identity f((l-r)u + rv) = (l-r)F(u) + rF(v) holds. We shall call a 
function /: P n — ► Q of n arguments multiaffine or n- affine if it is an affine function 
of each argument when the others are held fixed. 

The concept of affineness makes perfect sense even for infinite-dimensional 
spaces P and Q y but we won't need that generality in this paper. In the finite- 
dimensional case, affine maps are precisely those that can be represented by poly- 
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nomials of degree one. (Following our convention, the phrase "of degree one" here 
means "of degree one or less.") For example, if we choose Cartesian coordinate sys- 
tems for the finite-dimensional affine spaces P and Q> a map F: P Q is affine if and 
only if each coordinate of the result point F(u) can be written as a polynomial of de- 
gree one in the coordinates of the argument point u. For a multiaffine function /, the 
total degree of the polynomial that represents a coordinate of the point /(ui, . . . , u n ) 
might be as high as n; but each term of that polynomial will include at most one 
of the coordinates of any particular argument u*. For example, if f :P 2 — ► Q is 
a biaffine function whose arguments lie in the plane P of points u = {ti,v), then 
the polynomials that give the coordinates of /(ui,u 2 ) = f{{ui y v 1 ) ) {u 2 ,V2)) could 
include terms of the form auiu 2 or buiv 2 . But a term of the form cu x vi would 
imply non-affine dependence of /(ui,U2) on Ui. 

If P and Q are finite-dimensional afiine spaces, a function F:P — ► Q is called 
polynomial of degree n if, when we choose Cartesian coordinate systems for P and 
Q, each coordinate of the point F(u) can be written as a polynomial of degree n 
in the coordinates of the argument point u. This condition is independent of the 
choice of the coordinate systems in P and Q. The special names polynomial curve 
and polynomial surface are used for polynomial functions F: P — ► Q whose domains 
P have dimensions one and two, respectively. 

Proposition 2.1: The Blossoming Principle (affine variant). UP and Q are 
finite-dimensional affine spaces, then polynomial functions F:P Q of degree n are 
equivalent to symmetric, n-affine maps f :P n Q. In particular, given a function 
of either type, a unique function of the other type exists that satisfies the identity 
F(u) = /(u,u,...,u). We shall call f the multiaffine blossom of F and F the 
diagonal of/. 

Proof Since there is no interaction between the different coordinates of the points in 
the object space Q, it suffices to consider each coordinate separately, or equivalently, 
to consider the case in which Q is the real numbers R. 

Let p be the dimensionality of the parameter space P. Choose a Cartesian 
coordinate system for P, and let u = (u 1 , . . . ,u p ) denote the coordinates of the 
point u in P in this coordinate system. We are using superscripts to enumerate the 
p dimensions of P in order to leave subscripts free to enumerate the n arguments of 
the blossom /. 

By our definitions, the map F: P — ► R is a polynomial function of degree n if 
and only if the quantity F(n) is given by a polynomial of total degree no greater 
than n in the variables u 1 through u p . A term of this polynomial has the form 

T F :=r{u l ) k *{u 2 ) k *---{u*) k > 

for some real coefficient r and for some nonnegative integer exponents k 3 - satisfying 
k i ^ n - Similarly, the function /: P n -> R is n-affine if and only if we can 
express the value /(vi,v 2 , . . . ,v n ) as a polynomial in the np variables v\ whose 
general term has the form 
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where each e tJ is either 0 or 1 and we have £ ■ e»y < 1 for each fixed % . 

Suppose that we are given F and we want to construct / satisfying /(u,...,u) = 
^(u). Note that the term T f will contribute to T F when we substitute u for each 
of the v t if and only if € h' = k j for eac h J- If we let / denote the difference 

/ := n - J2j kj y the number of terms Tf that contribute to the particular term 2> 
is given by the multinomial coefficient ( fc n k If we want / to be symmetric, 
our only choice is to split up each term T F into ( fc n fc z ) symmetric terms Tf in / 
with equal coefficients. 

Going the other way, suppose that we are given /, whose terms Tf satisfy 
YsjCij < 1 by assumption. When we substitute u for each of the v», the total 
degree of the result will be £V £\ e t -y < n. Hence, the identity F(u) := /(u, . . . ,u) 
will define a unique polynomial function of degree n or less. □ 

Corollary 2.2. IfPandQ are finite-dimensional affine spaces, a symmetric n- affine 
map f :P n ~+Q is completely determined by its values on the main diagonal of P n , 
that is, by the values of the form /(u,u, . . . ,u). n 

The Blossoming Principle has a linear-world variant also: starting with a poly- 
nomial function F that is homogeneous gives a blossom / that is multilinear instead 
of just multiaffine. The only difference in the proof is that we always have / = 0 in 
the homogeneous case. The particular case n = 2 and Q = R of the linear variant 
is quite a famous theorem: it states that quadratic forms on a vector space are 
equivalent to symmetric bilinear forms [28, 32]. 

Our proof of the Blossoming Principle worked with explicit coordinate systems 
throughout. There are proofs that depend less on explicit coordinates, but they 
tend to obscure the essential simplicity of the Blossoming Principle. For example, 
consider a quadratic polynomial function F{u) in the affine world. The associated 
biaffine blossom /(u!,u 2 ) can be defined without any reference to coordinates by 
means of the formula 

4F(^)- J P(u 1 )-F(u 2 ) 

/K,u 3 ) = — i '—^ , 

which is called the polarization identity [19, 29, 33]. For general degree n, the 
analog of this formula, which we shall prove in Section 4, has an inclusion-exclusion 
structure: 

/(ui,u 2 ,...,u n ) = l £ (-) n -^(»E u 0- ( 2 ' 3 > 

' SC{1 ( 2 n} \ i€S J 

*=|S| 



Two comments about this formula before we proceed. First, the corresponding 
formula in the linear world can be made a little simpler, because the factor of 
k n outside F cancels against the 1/k in the argument when F is known to be 
homogeneous of degree n. Second, it may seem a bit dicey to allow S — 0 in the 
summation, since the argument of F in the S = 0 term is the indeterminate form 
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0/0. When n > 0, the indeterminacy in the argument of F doesn't matter, because 
that value of F is multiplied by 0 n = 0. In the special case n = 0, the indeterminacy 
doesn't matter for a different reason: F is constant. Indeed, if we tried to avoid the 
indeterminacy by explicitly demanding S ^ 0 in the summation, we would cause 
the formula to fail for the case n = 0. 

While Eq. (2.3) is coordinate-free, it is rather complicated. Furthermore, a 
proof of the Blossoming Principle based on this formula still has to chose a coor- 
dinate system for P in order to prove that the right-hand side does indeed define 
a multiaffine map. In fact, there is no hope of our avoiding coordinates entirely, 
since our definition of a polynomial function involved coordinates. More abstract 
texts manage to avoid coordinates by turning the Blossoming Principle from a the- 
orem into a definition: they define a "polynomial function" to be the diagonal of a 
multiaffine function. The argument above is then used to show that, in the finite- 
dimensional case, these "polynomial functions" are precisely the functions that are 
given by polynomials when expanded in coordinates. 

Restriction 2.4* From now on, all affine and linear spaces mentioned in this paper 
are implicitly assumed to be finite-dimensional. The exploration of the infinite- 
dimensional situation is left as a challenge for the reader (and for the author). 

The contrast between our proof of Prop. 2.1, which was based on choosing 
coordinates, and the coordinate-free Eq. (2.3) is a phenomenon that will come up 
again in Part D. One way to compute a blossom is to choose coordinate systems for 
all of the spaces involved, and then manipulate coefficients. This technique involves 
making lots of choices and defining lots of symbols. But, if the right coordinate 
systems are chosen, the actual arithmetic involved tends to be quite simple: just 
multiplying or dividing each coefficient by a factorial or two. We shall refer to this 
technique as coordinate- based. The alternative coordinate-free technique searches 
for a single formula that expresses an arbitrary value of the desired blossom as an 
affine combination of values of something known. The formulas that result from the 
coordinate-free technique are conceptually simpler, in the sense that they give the 
whole solution in one equation. But the combinatorics and arithmetic involved in 
them tend to be much more sophisticated. In the instance we have just seen, the 
coordinate-free approach may also seem more expensive, since the sum in Eq. (2.3) 
involves 2 n terms. On the other hand, note that the number of terms is independent 
of p = dim(P); thus, the same coordinate-free formula works for curves, for surfaces, 
or for p > 3. 

3. The blossoms of B6ziers 

Parametric polynomial curves and surfaces are basic objects in computer-aided 
geometric design, and the affine variant of the Blossoming Principle applies to them 
directly. In this section, we shall relate blossoms to the Bezier-de Casteljau theory 
of curves and surfaces. 

Before starting on curves, which are the easiest case, we shall pause to institute 
a new notational convention. A polynomial curve is a polynomial function F:L -+ Q 
whose domain L is a one-dimensional affine space, that is, an affine line. So far, 
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we have been ignoring the distinction between a point of the domain space L and 
a real number. For example, in Section 1, we wrote F(l) where we meant, more 
precisely, "the result of applying F to the point in L whose coordinate, in some 
chosen coordinate system for L, is the real scalar 1 * While it might seem pedantic 
to object to this practice, it turns out that a little pedantry now will pay off when 
we consider tensors in Section 4. In general, if P is an affine space for which we have 
chosen some Cartesian coordinate system, we shall use angle brackets to denote a 
point u in P given in terms of coordinates u = (u 1 , . . . , u p ). For the particular case 
of the affine line L, let us adopt the abbreviation i := (t) for the point in L whose 
coordinate is the scalar t. Under this convention, the expression 2 + 3 denotes the 
scalar 5; the expression (2+3)/2 denotes the point in L midway between the points 2 
and 3, which can also be written and the expression 2+3 is not well-formed, since 
it denotes a non-affine combination of points. To make this convention retroactive, 
we would have to go back and add lots of bars to Section 1. In particular, each 
argument to F, /, or g in the labels of Figs. 1.1 through 1.4 should have a bar over 
it. 

Let F:L — ► Q be a polynomial curve of degree n, and let /: L n — ► Q be its 
blossom; that is, / is the unique symmetric, n-affine map that satisfies F(u) = 
/(u, u, . . . , u), where u denotes an arbitrary point in L. Suppose that we choose a 
reference segment i] C L, and consider the value /(u x , u 2 , . . . , u n ). We can write 
the first argument point Oi of / as an affine combination of the points a and i as 
follows: 



=(^) 5+ fe) f - 



(This formula simplifies somewhat in the particular case of the reference segment 
[0,1]; we get tii = (1 - ui)0 + uj. But it goes against the grain of the affine 
approach to single out any particular reference segment.) Since / is affine in its first 
argument, it preserves affine combinations; therefore, 

/(fli,fl>,...,0»)= (jzj) /(*»«**,.•.,*»)+ (yfr) /(Mi.-- •>«»)• 

The next step is conceptually straightforward, although notationally somewhat 
clumsy: we want to simultaneously expand all n of the argument points tii through 
u n as affine combinations of § and f. The resulting sum has 2 n terms: 

n^e,^' 9j *> Kt - J ^ir^r 

(3.1) 

Note that we have taken advantage of the symmetry of / in order to put all of 
the a'a before all of the i'a in the arguments to /. The interesting thing about 
Eq. (3.1) is that it expresses every value of / in terms of just the n+ 1 special values 
whose arguments consist entirely of S'a and t 's. Thus, a symmetric n-affine function 
/: L n — » Q is completely determined once we know the particular values /(*, . . . , a), 
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/(«, . . . , I, t ), . . . , /(*, . . . , t) as points in Q. Furthermore, there are no constraints 
on these special values: if x 0 through x n are any n + 1 points in Q> the formula 

/c ■»>= s: n(^)nfe)^ 

/UJ={l,2,...,n} 

defines a symmetric n-affine function f:L n —> Q that satisfies 
This proves the following. 

Proposition 3.2. Let [f,t] be a reference segment in the affine Jine L. The formula 

/(*,..., S, -x fc 

n-Jfe k 

for k in [0, n] provides a one-to-one correspondence between symmetric, multiaffine 
functions f:L n ->Q and (n + l)-tupies (x 0 , . . . , x„) of points inQ. □ 

Proposition 3.3: Bezier curve segments. The formula in Prop. 3.2 also pro- 
vides a one-to-one correspondence between polynomial curves F:L — ► Q of degree n 
and (n + l)-tuples (x 0 , . . . ,x n ) of points in Q, where the f in Prop. 3.2 denotes the 
blossom of F. Moreover, the points x* are precisely the Bezier points by which the 
curve segment F([§,i]) is controlled in the standard theory 

Proof. The first claim follows at once from Prop. 3.2 and the Blossoming Principle. 
As for the connection with the Bezier theory, the Bezier n-ic curve F(u) is most 
often defined by using the Bernstein basis polynomials [5] to blend together the 
Bezier points Xo through x n : 

In our theory, the curve F(u) is the diagonal /(ti,...,u) of its blossom. If we 
substitute u for each of the ti t in Eq. (3.1), we find that 

- /(• •>-£(;) (j^T (Hf ) ' f^-h^D- 

0<k<n n-fc k 

Comparing these two formulas for F(u) completes the proof. □ 

It is interesting to note that the convex-hull property of the Bezier scheme for 
curves carries over from a curve F to its blossom /. The standard Bezier theory 
tells us that the point F(u) will be within the convex hull of the Bezier points of 
the segment as long as u is in [«,*]. More generally, the blossom value 

/(tii, * * * ) «n) will also be in that convex hull as long as all of the tx t are in [f , i ]. 
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Fig. 3.4 The de Casteljau Algorithm for a quadratic Bezier surface 

We can begin to develop a geometric intuition for the behavior of blossoms from 
Prop. 3.3. Consider a quadratic polynomial curve in the plane, that is, a parabola F. 
For any a and t> we know from Prop. 3.3 that the three Bezier points of the parabolic 
segment F([a, i]) are the blossom values / (a, a), /(*, i ), and f(t, f). Hence, the point 
f(a,t ) must be the intersection of the tangents to the parabola F(u) at u = s and 
u = i, as shown (without the bars) in Fig. 1.1. As a consequence, the range of the 
blossom / is precisely the parabola itself together with its exterior. Given any point 
x exterior to the parabola, we can compute the unique 8 and i with the property 
that x = f(a,i) by finding the two tangents to the parabola that pass through x. 
This type of geometric intuition is a valuable adjunct to the algebraic techniques 
that we used in proving the Blossoming Principle. As a sample of the algebraic 
approach, note that the blossom / of the standard parabola F(u) ~ (u,u 2 ) is given 
by /(fl,f0 = ((tt + v)/2,tw>. 

Surfaces aren't much harder than curves. Recall that a polynomial surface is 
simply a polynomial function F:P Q whose domain P is two-dimensional. The 
blossom of a surface F is a symmetric, multiaffine function f:P n -► Q. Fig. 3.4 
shows an application of the de Casteljau Algorithm computing a point on a quadratic 
surface from the six Bezier points of that surface. The surface F itself, which is not 
shown, is the locus of all points of the form /(u,u) as u varies over the parameter 
plane P. The image under F of the domain triangle Arst is a triangular surface 
patch with vertices /(r,r), /(s,s), and /(t,t) and with parabolic edge curves. (De- 
generacies are even harder to avoid for surfaces than for curves; since a quadratic 
surface F has six Bezier points, we would need five dimensions to draw one that is 
nondegenerate.) 

Proposition 3.5. Let Arst be a reference triangle in the two-dimensional affine 
space P. A symmetric, n-affine function f :P n -► Q is completely determined by 
its values on argument bags (that is, multisets) that consist entirely of t's, s's, and 
t 's. In particular, the formula 
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provides a one-to-one correspondence between symmetric, n-affine functions f and 
triangul&r arrays of ( n + 2 ) points (x»y fc ) in Q> where t, j, and k are nonnegative 
integers that sum to n. 

Proof. In our discussion of curves, we used the two endpoints of the reference 
segment [l,t] as an affine frame for L. (A frame for an affine space is the affine 
analog of a basis for a linear space.) Analogously, we shall use the three vertices 
r, s, and t of the reference triangle to form an affine frame for the plane P. Every 
point u in P can be written uniquely as an affine combination of the points in the 
frame, in the form 

u = /? r (u)r + £.(u)s + /? t (u)t. 

The real coefficients #r(u), #,(u), and /?t(u) in this expansion are called the bary cen- 
tric coordinates of the point u, and they sum to 1. 

Let f :P n — ► Q be a symmetric, n-affine function. If we expand the first argu- 
ment Ui of / in terms of r, s, and t, we deduce that /(ui, . . . ,u n ) is equal to 

£ r (ui)f (r,u 3 , . . . ,u n ) + ^ 

If we similarly expand each of the other arguments of /, we arrive at a sum for 
/(ill, • • .,u n ) that has 3 n terms: 

e n^( u <) n aw n M^)/(r 1 ^,» 1 ^, (^) 

/,J,JC any *€/ jeJ *€JC .ji i j. \ K \ 

partition of i i i i i i 

{1.3, .-.,»} 

This formula expresses every value of / in terms of the ( w + 2 ) special values in 
which all of the arguments of / are either r's, s's, or t's. Furthermore, there are no 
constraints on these ( n £ 3 ) special values, since the formula 

/(m,...,u n ) := nprwUhM n Pt(^)x vnJm 

I,J,K any *€/ j€J h^K 

partition of 
{IX 

defines a symmetric, n-affine function whose special values satisfy Eq. (3.6). □ 

Proposition 3.8: B6zier triangular surface patches. IfF;P-+Qisa polyno- 
mial surface of degree n and Arst is a reference triangle in the parameter plane P } 
then the Bizier points by which the triangular surface patch F(Arst) is controlled 
in the standard theory are precisely the blossom values 

/(r,...,r,s,...,s,t,...,t). 



Proof When all of the arguments of the blossom /(ui, . . . ,u n ) are equal, the 3 r 
terms in Sum (3.7) collapse down as follows: 



/(u,...,u)= £ ( " Wu)<A(u)'A(u)*/(r. 
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Comparing this formula with the standard expansion of a Bezier surface in terms of 
its Bezier points [5] completes the proof. □ 

The case where the domain space P has arbitrary dimension p is completely 
analogous, although it is such a notational challenge that we won't bother to go 
through it in detail. One begins by choosing a reference p-simplex [r 0> ri,. . . ,r p ], 
whose vertices provide an affine frame for P. A polynomial function F:P -+Q then 
has ( n + p ) Bezier points, each of which is the result of choosing, with replacement, n 
points from among the p+ 1 simplex vertices and evaluating the blossom / on those 
n points. Every value of the blossom can be expressed as an affine combination of 
these Bezier points by means of a sum that involves (p + l) n terms. 

4. The Tensoring Principle 

Blossoming is quite a useful trick; this was suggested by the figures in Section 1 
and will be borne out in more detail in what follows. But there are some problems 
with blossoming as we know it so far. To begin with, there is a notational hassle: 
we had to resort to horizontal braces in order to write down the Arth Bezier point 

n-k k 

of an n-ic curve F : L -+ Q as a value of the blossom /. It would be more convenient 
if the arguments to a blossom were multiplied, to form a product, rather than con- 
catenated, to form a sequence. Then we could write the Arth Bezier point something 
like w /(* using exponential notation. It turns out that there is another 

important principle of mathematics, called the tensor-product construction, that 
can be used to convert an old-style, multiaffine blossom / into a new-style, affine 
blossom whose argument is a product of n points. More precisely, an affine 
blossom takes an n-tensor as its argument, where a point is a 1-tensor. Our goal 
in this section is to apply the tensor-product construction to convert our blossoms 
from multiaffine maps to affine maps, thus reaping both notational and conceptual 
benefits. 

Warning 4.1. Tensoring is not as hard as most people think, but it is harder than 
blossoming — it demands some tolerance for abstraction. If you get confused in the 
next couple of sections, there are two things that you might try. One is to jump to 
Section 7, and then to Sections 14 and following, treating expressions of the form 
/®(u x • • -u n ) as the author's funny way of writing the blossom value f(u u . . . ,u n ). 
Another alternative is to give up on this paper and switch to the Reader's Digest 
version [35], in which I describe blossoming as clearly as I could in twenty pages, 
with no tensors. 

More formally, the goal of our tensoring work is as follows: Given an affine space 
P and a degree n, we want to construct an affine space P® n , called the n-th symmet- 
ric tensor power of P. The interesting property of the tensor power space P® n is that 
symmetric, multiaffine maps f:P n -> Q (such as old-style blossoms) are equivalent 
to affine maps /® : p®» Q (new-style blossoms). In particular, the correspon- 
dence between / and /• is reflected in the identity /(u x ,. . . ,u n ) = /®(u x -u n ), 
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where, on the right-hand side, we are using the tensor product operation to build 
up an n-tensor in P® n out of points, which are 1-tensors in P = P® 1 . 

Different variants of tensoring show up in different places in mathematics. In 
each case, the effect of the tensor-product construction is to replace multilinearity 
with simple linearity, at the price of complicating the domain space. In linear alge- 
bra, for example, an asymmetric, linear variant of the tensor-product construction 
is used to replace a bilinear map whose domain is a Cartesian product V x W of 
linear spaces with a linear map whose domain is the tensor product V ® W. 

The variant of tensoring that we need is symmetric and affine, instead of asym- 
metric and linear. The affine versus linear issue doesn't make much difference, but 
the symmetry condition in our variant simplifies things in important ways. First, 
it means that we will be computing the tensor powers P® n of a single space P, 
rather than the tensor products of different spaces. Second, the product operation 
that builds n-tensors in P® n out of points in P = P® 1 in the symmetric case is 
commutative. Therefore, we will use simple multiplication, rather than the more 
standard "®", to denote that product, writing an n-tensor as Ui * • • u n , rather than 
as Ui ® • • • ® u n . Third, in the symmetric case, all of the tensors on a space P, under 
addition, scalar multiplication, and tensor product, form an algebra that is isomor- 
phic to a very familiar algebra: the algebra of polynomials in p + 1 independent 
variables, where p = dim(P). See, for example, Prop. 8.1 in Chapter XVI of the 
second edition of Lang's Algebra [34]. We will take advantage of this isomorphism 
by using certain spaces of formal polynomials as the raw material out of which we 
will build our tensor power spaces. 

Warning: "Tensor-product surfaces" in computer-aided geometric design have 
"tensor" in their name because of an application of the more common, asymmetric 
variant of the Tensoring Principle. In Section 12, we will finally get around to 
considering tensor-product surfaces and to studying the relationship between that 
variant of the Tensoring Principle and the symmetric variant that we are about to 
develop and apply. 

It's time for some mathematics. We know that the tensor power space P® n 
should include elements that correspond to the products of n points in P. One way 
to build such a space is to treat the points in P as formal symbols and to allow 
ourselves to multiply them formally. If we do precisely that, we end up with a 
technique that applies, not to symmetric, multiaffine maps, but rather to arbitrary 
symmetric maps. 

Let S be any set, let Q be an affine space, and let h:S n -> Q be an arbi- 
trary symmetric function. For each element 8 of 5, we shall invent a brand-new 
formal variable, written X 0 . An X -polynomial on S is a polynomial in these new 
X -variables. Let X[S] denote the algebra consisting of all X-polynomials on S 
(with coefficients in the real numbers R). There are as many X- variables as there 
are points in 5, possibly infinitely many; but any particular X-polynomial $ in 
X[S] can only involve a finite number of the X-variables. 

One simple type of X-polynomial is an X-monomial of degree n, that is, an 
expression of the form X 9l • • • X 9n , where the 84 are elements of 5 . Since the function 
h is symmetric, we can, in some sense, evaluate h at such a monomial. To be more 
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precise, we can define a new function h x whose arguments are X-monomials by 
means of the formula h x (X Sl . . • X 8n ) := h(a u . . . , s n ). It is convenient to extend 
the domain of this new function h x to include affine combinations of X-monomials 
as well. For example, if n = 2, we would like equations such as 

, (*L . X*Xt\ = h x (X*) h x (X,X t ) _ h(r y r) h(s,t) 
\2 T 2 j 2^ 2 "2 + ~~2~ 

to hold. 

In order to be a legal argument to this extended function h x , an Jf-polynomial 
$ must be homogeneous of degree n. In addition, since the range Q of h is only 
an affine space, the coefficients of $ must sum to 1. To help formalize this lat- 
ter restriction, let us, just for the purposes of this paper, refer to the sum of the 
coefficients of an A"-polynomial $ as its flavor, written Flav($); the latter restric- 
tion is then Flav($) = 1. Note that we could characterize the flavor functional 
Flav:Jt[S]^R more abstractly as the unique algebra isomorphism with the prop- 
erty that Flavpf.) = 1 for all points s in S. Let X*[5] denote the affine sub- 
space of the algebra X[S] consisting of all polynomials that are ^-homogeneous and 
c-flavored. Then, the domain of our extended function h x is the affine space Xf [5]. 

We can summarize the results of the last few paragraphs as the following easy 
proposition. 

Proposition 4.2: The X-Polynomial Construction. If S is any set and Q is 
any Affine space, symmetric functions h:S n Q are equivalent to affine functions 
h x :X?[S] -> Q. In particular, given a function of either type, a unique function of 
the other type exists that satis&es the identity h(si 9 . . . , s n ) = h x (X 01 • • • X 9n ). □ 

The X-Polynomial Construction is a (rather trivial) tool for analyzing sym- 
metric functions whose values lie in an affine space. Since an old-style, multiaffine 
blossom /: P n — ► Q is such a function, we can apply the X- Polynomial Construction 
to / to obtain a corresponding affine map f x :X?[P] Q. Unfortunately, we can't 
always go backward: if we start with an arbitrary affine map h x :X?[P] -* Q, the 
symmetric function h;P n -> Q that corresponds to h x under the Jf -Polynomial 
Construction probably won't be multiaffine, and hence won't be a blossom. For 
example, suppose that P is the affine line L, and consider an affine property of 
L, such as 1 = (0 + 2)/2. In order to make h:L n «► Q be multiaffine, the map 
h x :X?[L] — ► Q would have to map the X-polynomial Xj9 to the point in Q 
halfway between h x (X 0 $) and h x (X 7 $) } where $ denotes any X-polynomial in 
X" [£]. Since h x is affine, we are guaranteed that 

But there is no particular relationship, in the X-Polynomial Construction, between 
(X 0 + and X f . Indeed, X 0 , JCj, and X 5 are just three different formal 

variables out of the uncountable collection X P of formal variables. Intuitively, the 
X- Poly normal Construction doesn't handle blossoms properly because it ignores the 
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affine structure of the parameter space P. More formally, the affine space X"[P] is 
too big to serve as the tensor power space P® n . 

To get a better sense of where we stand, let us rephrase our blossom-based 
analysis of polynomial curves in Prop. 3.2 using the affine function fx instead of 
the blossom / itself, even though we already know that the domain -X"J*[P] of f x is 
not the tensor power space for which we are searching. In particular, let /: L n — ► Q 
be the blossom of an n-ic polynomial curve, and let fx>Xi[L] — ► Q be the affine 
function that corresponds to / under the X-Polynomial Construction. In proving 
Prop. 3.2, we expressed an arbitrary blossom value f(u\, * - • > fi n ) = fx(X Ql • • • X an ) 
as an affine combination of the Bezier points fx{X^~ k X^) > where [*, t ] is a reference 
segment in L and Jfc is in [0,n]. Converting to the use of fx throughout, our proof 
of Prop. 3.2 looks like this: 



fx(X ai - -X an ) = f x \ 

= fx\ 



,Jb n(^)n(^)*K 

/nJ=0 

e n(^)nte)/xW)- 



/UJ={1,2 n} »€/ v ' j€J 



Of this three-step derivation, the second step is simple algebra and the third step 
is justified because fx is affine. The first step is the subtle one. We are allowed to 
replace the factor X Gi in the argument to fx by the X-binomial 



only because the multiaffineness of / implies that fx has special properties. Once 
again, we have run across the problem that the X-Polynomial Construction ignores 
the affine structure of P. Our next task is to fix that problem. 

There are two ways to impose on the X-variables the affine structure of their 
subscripts, and hence to shrink the space X?[P] down to something that can serve 
as the nth tensor power of P. The simplest approach is to choose, once and for all, 
an affine frame [ro,...,r p ] for P and then to restrict ourselves to using only the 
p+ 1 distinguished X- variables X To through X Tp . Since the points ro through r p 
form a frame, there are no affine relationships among them, so there is no need to 
impose any affine structure on the corresponding frame variables X To through X Tp . 
In this approach, instead of writing X u for u in P, we must first expand u as an 

affine combination of the frame points, say u = f} 0 (u)To H h /? p (u)r p , and then 

write instead the corresponding affine combination of the frame variables. We can 
define a map p:Xp — ► Xj[r 0 , . . . ,r p ] that rewrites an arbitrary X-variable in terms 
of the frame variables by the formula 

p(X u ) := /? 0 (u)X ro + • • • + 0 P WXr p . 
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If $ is any X-polynomial on P, we can apply p to each X-variable in $ separately 
to rewrite $ completely in terms of the frame variables. We shall refer to the 
resulting map p:X[P] — ► X[ro, . . . ,r p ] as the normal-form map associated with the 
affine frame [r 0 , . . . , r p ] for P. Note that the normal form />($) has the same degree 
and the same flavor as furthermore, if $ is homogeneous, then p($) will be 
homogeneous also. Using p, we can summarize this first approach to tensor powers 
as follows: Symmetric, multiaffine maps f:P n — ► Q are equivalent to affine maps 
/jc:-X?[ro,...,r p ] — ► Q, Given a map of either type, a unique map of the other 
type exists that satisfies f(n u . . . ,u n ) = fx(p(X Ul • • • X Un )). 

One problem with this first approach is the need to apply the normal-form 
map p as part of the fundamental correspondence. But a more severe problem is 
the dependence of the entire approach on the choice of a particular affine frame for 
P. Note that the very definition of the tensor power space itself, X?[r 0 , . . . ,r p ], is 
dependent on the frame. Such a frame dependence would be particularly clumsy 
when we started working with splines, since we will often want to use a different 
frame for each piece into which the parameter space P is partitioned. Therefore, we 
shall adopt a more high-powered, second approach to tensor powers, one in which we 
shrink the space -X^P] down by defining an equivalence relation on ^-polynomials. 

The key idea behind this second approach comes from considering the rela- 
tion on X-polynomials determined by the equation p($i) = The map p 
depends upon a choice of an affine frame for P, of course. But the truth or falsity 
of the relation = is independent of the frame. We shall say that $i 
and $ 2 are affinely equivalent, written <& t ~ $ 2 , if the equation p($i) = p(*j) 
holds in one frame, and hence in all frames. We will use square brackets to denote 
equivalence classes with respect to affine equivalence; that is, [$] denotes the set 
of all JK-polynomials that are affinely equivalent to Note that, if $ t ~ $ 2 and 
*i ~ *2> then u^i + v*! ~ u$ 2 + v* 2 , and also $1*1 ~ Therefore, we can 
take linear combinations of equivalence classes or multiply equivalence classes and 
get well-defined results. 

Let X [P] denote the set of all affine equivalence classes of X-polynomials in 
-X"[P]. Under addition, scalar multiplication, and polynomial multiplication, this 
set becomes an algebra, which is called the symmetric tensor algebra of P [34]. 
The elements of this algebra are called tensors on P; that is, a tensor on P is an 
affine equivalence class [*] of X-polynomials on P. The multiplication operation on 
tensors [*] • [*] := is called the tensor product 

If [ro, . . . ,r p ] is any affine frame for P, we can use the associated normal-form 
map p to choose a canonical representative out of each equivalence class, that is, to 
choose a normal form for any tensor. In particular, the normal form of the tensor [$] 
is the X-polynomial p($). In fact, this correspondence is an algebra isomorphism 
between the tensor algebra X[P\ and the polynomial algebra X[r 0 , . . . ,r p ], a fact 
that is worth remembering. 

Proposition 4.3. Let [r 0 , . . . ,r p ] be any affine frame for P, and let p be the asso- 
ciated normal-form map. Then, the correspondence [$] »-+ p($) is an isomorphism 
between the symmetric tensor algebra X[P) of P and the algebra X[r 0i . . . ,r p ] of 
polynomials in the p+1 independent variables X ro through X Tp . □ 
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Corollary 4.4. The symmetric tensor algebra X [P] is a unique factorization do- 
main. □ 

A tensor [$] on P is called homogeneous of degree n, or n-homogeneous, or 
simply an n-tensor, if the affine equivalence class [$] contains some X-polynomial 
that is n-homogeneous. The X-polynomial $ itself may not be homogeneous. For 
example, over the line L, the tensor [$] where * = X^X^ + X 0 + X 2 - 2Xi is 
2-homogeneous, since + X% ~ 2X\ implies $ ~ X 3 X 4 . Extending the concept 
of flavor from X-polynomials to tensors is easier: all of the X-polynomials in an 
affine equivalence class [$] must have the same flavor, and we shall refer to that 
common value as the flavor of the tensor [*]. Let JC*[P] denote the affine subspace 
of the tensor algebra X[P] consisting of all c-flavored fc-tensors on P. We shall use 
Jiff [P] as our model of the nth symmetric tensor power P® n of P. That is, we will 
define P® n := Jlfr[P] to be the affine space consisting of all 1-flavored n-tensors. 
If we choose any affine frame for P, the correspondence of Prop. 4.3 will give an 
isomorphism between X?[P] and X? [r 0 , . . . ,r p ]. Thus, our second approach to 
tensor powers gets the same answer as the first approach did; but it does so in a 
frame-independent way. 

Tensors on P that are homogeneous of degree one are particularly simple; in 
fact, a 1-flavored 1-tensor on P is essentially the same thing as a point in P. To 
see this, note that every X-polynomial $ in -X"J[P] is simply an affine combination 
of X-variables. If we take the corresponding affine combination of the subscripts 
of those variables, we get a point u in P with the property that $ ~ X u . Thus, 
every equivalence class in X* [P] has the form [X u ] for a unique point u in P, 
and the first tensor power P® 1 = Xi[P] of P is actually isomorphic to P under the 
correspondence [-X" u ] »-* u. By an abuse of notation, we shall use this correspondence 
to identify P® 1 with P. This identification allows us to reinterpret the tensor- 
product operation as a multiplication on the points of P; that is, if u and v are 
points in P, and hence also 1-flavored 1-tensors on P, their product uv is the 
1-flavored 2-tensor [X U X V ]. 

It is at this point that we earn the reward for our earlier pedantic insistence on 
distinguishing between points u in the affine line L and scalars u in the coefficient 
field R. Note that the expression 2 3, being the product of two points in L, is a 
1-flavored 2-tensor on L. If we weren't distinguishing between points and scalars, 
we would have to use a special symbol to denote the tensor product operation on 
points in L or else we would risk confusing the tensor product 2 • 3 = 2 3 of the 
points 2 and 3 with the normal product 2 • 3 = 6 of the scalars 2 and 3. 

For n > 2, n-tensors are a little more subtle. For example, every product of 
n points Ui • • -u n is a 1-flavored n-tensor; but not every 1-flavored n-tensor can be 
written as the product of n points. As we shall study in Section 9, the n-tensors that 
can be factored into a product of 1-tensors are called simple, while the rest are called 
compound — and compound n-tensors do exist for n > 2. While not every n-tensor 
is simple, the simple n-tensors do span the tensor power space P® n = Jfi n [P]; 
this follows from the obvious fact that every X-polynomial in X?[P] is an affine 
combination of X-monomials of degree n. 
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Proposition 4.5: The Tensoring Principle, symmetric affine variant. Let 

P and Q be finite-dimensional affine spaces, and let P® n = [P] be the nth 
symmetric tensor power of P. Symmetric, n-affine maps f :P n — ► Q are equivalent 
to affine maps /® : P® n — > Q. In particular, given a map of either type, a unique map 
of the other type exists that satisfies the identity /(ui, . . . ,u n ) = /®(ui • • -u n ). 

Proof. Suppose that we start with a symmetric, multiaffine map f:P n — ► Q. The 
X- Polynomial Construction gives us an affine map f x :Xf[P] — ► Q that satisfies 
/(ui, . . . ,u n ) = fx(X Ul • • -X Uf J. Since / is multiaffine, the map f x will have the 
additional property that, if two X-polynomials $i and $ 2 in Xf[P] are affinely 
equivalent, then fx($x) = fx($2). Hence, we can define a map /®: X?[P] — ► Q by 
the formula /•([#]) := /*(*). It follows that - ..u n ) = Jf n J) = 

fx{X Ul •••JC Ub ) = /(ui, . . . ,u n ). The map /® is affine because fx is affine. Fi- 
nally, the map /® is unique because the correspondence with / determines the 
values of /® on all simple 1-flavored n- tensors, and those simple tensors span the 
domain of /®. 

The same procedure works in reverse. Suppose that we start with an affine 
map X?[P] -> Q. We can define an affine map f x .X?[P] -> Q by the formula 
fx(®) := /*([*])• The X-Polynomial Construction then gives us a symmetric 
map f:P» - Q that satisfies /(m, . . . ,u n ) = /x(X Ul ~-X Um ) = /•(ui---u n ). 
The map / will be multiaffine because fx is constant on affine equivalence classes. 
Finally, the correspondence with /® determines every value of /, and hence the map 
/ is certainly unique. □ 

Corollary 4.6, An affine map /® : P® n — ► Q is completely determined by its values 
on the n-tensors u n that are perfect nth powers of points in P. 

Proof This follows at once from the Tensoring Principle and Cor. 2.2. □ 

If F: P — ► Q is a polynomial map of degree n, we now have two different 
blossoms for F: a symmetric, multiaffine map f :P n — ► Q that satisfies F(u) — 
/(u,...,u) and an affine map f® : p® n -> Q that satisfies F(u) = /®(u n ). We 
shall distinguish between these two blossoms by calling / the multiaffine blossom 
while calling /® the affine blossom. As a first step in understanding how these 
blossoms relate to each other, the following proposition considers their ranges. 

Proposition 4.7. Let F:P -+ Q be an n-ic polynomial function, let f :P n ->Q be 
the multiaffine blossom ofF, and let : P® n -► Q be the affine blossom ofF. Also, 
let the set B in Q consist of the Bezier points of F with respect to some reference 
simplex in P. Finally, let Span(S) denote the affine span of a subset S of Q. Then, 
we have 

Span(Range(P)) = Span(B) = Span(Range(/)) = Range(/®). 

Proof Since we can express P(u) as an affine combination of the Bezier points, 
we have Range(P) C Span(fl). Since each Bezier point is a value of the multi- 
affine blossom, we have B C Range(/). Finally, the fundamental correspondence 
/(ui,...,u n ) = /®(ui---u n ) shows that Range(/) C Range(/®). Putting these 
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facts together, we have 

Span(Range(P)) C Span(B) C Span(Range(/)) C Span(Range(/®)). 

Since /® is an affine map, its range is already a flat (that is, an affine subspace), so 
Span(Range(/®)) = Range(/®). We will be done if we can show that Range(/®) C 
Span(Range(P)). 

To show this, it is enough to show that /®(ui---u n ) is in Span(Range(P)) 
whenever the u» are points in P, since the simple n-tensors ui • • • u n span the domain 
P® n of /® . That is, it is enough to show that /(ui, . . . ,u n ) is in Span(Range(P)). 
This is demonstrated in an explicit way by Eq. (2.3), 

/(ni,ii„...,n«) = ^ £ (-)»-**"*(£ 5>\ (2.3) 

SC{l,2,...,n} V %eS J 

k=\S\ 

which actually expresses the blossom value /(ui, . . . ,u n ) as an affine combination 
of values of F. There is only one difficulty: we didn't prove Eq. (2.3) — we only 
stated it. We can prove Eq. (2.3) by proving the following equality between tensors: 

k=\S\ 

Eq. (2.3) follows from Eq. (4.8) by applying /® to both sides. 

Proving Eq. (4.8) is a straightforward exercise. Working on the right-hand 
side, we first cancel the k n in the numerator and denominator, and then apply the 
Multinomial Theorem to expand the nth power, getting 

a £ (->"-'*' £ Gx"J u? 

SC{l,...,n} A+-+i.=» Ul Jn/ 

j;>o=>*es 

Interchanging the order of summation, we have 

h £ G,"^)"'-"- S (-)- |s| - 

* + Ul Jn/ SC{l ( ...,n} 

j<>0=>t€5 

If there is any t such that j\ = 0, the inner sum will give zero, since we can pair off 
sets S that do and don't contain i. In order for all of the to be positive, they must 
all be 1, which forces S = {1, . . . , n}, so the right-hand side simplifies to u% • • • u n , 
and we are done. □ 

Turning from ranges to domains, the domain of the affine blossom /® is the 
tensor power space P® n , which is a rather complicated thing. Whenever we want 
to know something about that space, however, we can choose an affine frame for P 
and then use the isomorphism of Prop. 4.3 to convert our question about P® n into 
a question about X-polynomials. For example, suppose that we wanted to know the 
dimension of the tensor power space P® n . 
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Proposition 4.9. If P is a p-dimensional affine space, the nth symmetric tensor 
power P® n of P is an affine space of dimension ( n + p ) - 1. 

Proof. Choose an affine frame [r 0 , . . . ,r p ] for P. The normal-form correspondence 
of Prop. 4.3 then provides an isomorphism [$] i-> between the nth tensor power 
p®n = jfnjpj of p and the space x?[r 0 , . . . ,r p ] of 1-flavored X-polynomials that 
are n-homogeneous in the frame variables X ro through X Tp . The latter space has 
an obvious affine frame consisting of the ( n + p ) different X-monomials of the form 

X rH ' ' ' X *Z wh ere £)L 0 h = n. The dimension of the space XJ*[r 0 , . . . ,r p ], and 
hence also the dimension of P® n , is just one less than the number of points in this 
frame. □ 

Two closing remarks: First, the Tensoring Principle is frequently applied in 
mathematical areas where a duality is present. In such areas, tensors come in three 
varieties: the purely primal ones, which are called contravariant, the purely dual 
ones, which are called covariant, and the impure ones, which are called mixed. (See 
Dodson and Poston [18] for the sad story of how the primal concept got the name 
"contravariant.") In this paper, we are dealing exclusively with points and other 
primal objects — there is no duality in sight. In particular, all of our tensors are 
contravariant; normally, we won't bother to mention this fact. Second, Prop. 4.5 
deals with the symmetric affine variant of the Tensoring Principle. Before we are 
done, we will find good uses for four different variants: symmetric and asymmetric, 
affine and linear. It is worth noting that there is a fifth variant, which we won't 
have any use for in this paper: the antisymmetric or alternating linear variant. 
This variant arises in the study of multilinear functions /(ei,...,e n ) that change 
sign whenever any two arguments are interchanged. 

5. The Homogenizing Principle 

So far, we have been working in the affine world. That is appropriate, since 
we started out trying to understand the nonhomogeneous polynomial curves and 
surfaces that are used in computer-aided geometric design. But we won't be able to 
stay in the affine world forever: the derivative F* of a polynomial curve F:L — ► Q 
has values that are vectors on Q y rather than points in Q, and the set of all vectors 
on Q forms a linear space rather than an affine space. To prepare for such forays 
into the linear world, we shall devote this section to our third (and last) important 
mathematical principle, the Homogenizing Principle, which converts from the affine 
world to the linear world at the price of adding one more dimension to the spaces 
involved. With all three principles at our disposal, we will be able to consider any 
polynomial map F in six different guises, as shown in Fig. 5.1. 

The Homogenizing Principle is quite common, both in mathematics and in 
computer-aided geometric design. For example, the usual way of representing affine 
transformations of 3-space in computer-aided geometric design is to use 4-by-4 ma- 
trices whose last column is the unit vector (0,0,0,1). Converting an affine map 
between 3-spaces into a restricted type of linear map between 4-spaces is a typical 
application of the Homogenizing Principle. 

The first step on the road to the Homogenizing Principle is to linearize the 



26 



THE THREE PRINCIPLES 



5.1 



Blossoming 



Tensoring 



Polynomial map 


Homogeneous polynomial map 


F:P-~>Q 


P*:P* 


polynomial of degree n 


homogeneous polynomial of degree n 




flavor-exponentiating 


Multiaffine blossom 


Multilinear blossom 


f:P n ^Q 


/*:(P*) n -Q* 


symmetric, multiaffine 


symmetric, multilinear 




flavor-multiplicative 


Afflne blossom 


Linear blossom 




ff:P?"-+Q* 


affine 


linear 




flavor-preserving 



Homogenizing 

Fig. 5.1 The six guises of a polynomial map 

affine spaces involved, that is, to imbed them as hyperplanes in linear spaces of one 
higher dimension. It turns out that we already did this as part of our work on the 
Tensoring Principle. Given an affine space P, we constructed the tensor algebra 
X{P}. In this algebra, consider the set X*[P] of all 1-tensors of all flavors. This 
set Xl [P] is a linear space, which we shall henceforth abbreviate as P* and refer 
to as the linearization of P. (Comment for nit-pickers: The origin of the linear 
space P* is the zero tensor [0] on P, which we include in the set of 1-tensors Xl [P] 
by convention, even though the degree of the zero polynomial is not well-defined. 
More precisely, we make the convention that the zero polynomial is homogeneous 
of degree k for every A:.) We shall also define P c := X}[P] to be the affine space 
consisting of all c-flavored 1-tensors on P, for any real number c. The spaces P c are 
parallel hyperplanes in the linearization P* of P. Since we are identifying points 
in P with 1-flavored 1-tensors on P, the space P = P x itself is a hyperplane in its 
linearization P*. 

The hyperplane parallel to P that goes through the origin of P* is Po, the set 
of all 0-flavored 1-tensors on P. Note that Po is actually a linear space, since sums 
or scalar multiples of 0-flavored tensors are 0-flavored. The elements of Po are called 
vectors on P, or, more precisely, free vectors on P. Note that the difference £ = v— u 
of two points in P is a vector on P, since the difference of two 1-flavored 1-tensors 
is a 0-flavored 1-tensor. The freedom in the term "free vector" is the freedom to 
slide the arrow from one place to another in P, as long as its length and direction 
are preserved; that is, if £ = v - u, then we also have £ = (v + 1)) - (u + r/) for any 
vector rf on P. 

The common types of coordinate systems on an affine space P extend easily 
to coordinate systems on its linearization P*. The two cases of interest to us are 
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Cartesian coordinate systems and barycentric coordinate systems. 

To form a Cartesian coordinate system for an affine space P, we choose an 
origin point o in P and a basis of vectors 6 Xy . . . , S p for the linear space P 0 . Then, 
each point u in P can be expressed uniquely in the form 

p 

1=1 

Note that the 1-tensors {o,$i, . . . ,6 p } are linearly independent, when viewed as 
elements of the linearization P*. Hence, we can get a coordinate system on P* 
by using those p + 1 tensors as a basis. The effect of doing so is to extend the 
coordinate system from P to P* by adding one new coordinate that measures flavor. 
In particular, an arbitrary 1-tensor c in P* can be expressed uniquely in the form 

e=( C 1 ,...,^;e 0 )=e°o + ^e^, 

*=i 

where e° = Flav(e). In order to follow the convention that is standard in computer- 
aided geometric design, we shall write the flavor coordinate c° last inside the angle 
brackets, even though it is numbered zero. 

Barycentric coordinate systems are even easier to extend. A barycentric coor- 
dinate system for the affine space P begins with an affine frame [r 0 , . . . ,r p ] for P. 
Each point u in P can be written uniquely as an affine combination of the frame 
points, with the barycentric coordinates as the coefficients: 

p p 
u = <u°, . . . , u p ) = ]P u*r«, where u 4 1 - 1. 

*=o »=o 

To extend such a coordinate system to the linearization P* of P, we just drop the 
restriction about the coordinates summing to 1. That is, using the same frame 
points Ti as a linear basis for P*, an arbitrary 1-tensor e in P* can be expressed 
uniquely in the form 

i=0 

The flavor of the 1-tensor e is given by the sum Flav(e) = 53?=o c * °* lts barycentric 
coordinates. 

The Homogenizing Principle comes in many variants. The simplest variant is 
the one that converts an affine map into a linear map. 

Proposition 5.2: The Homogenizing Principle, linear variant. If P and Q 
are finite-dimensional affine spaces and P* and Q* are their linearizations^ affine 
maps h: P — ► Q are equivalent to Savor-preserving linear maps A*:P* -* Q*. That 
is, given a map of either type, a unique map of the other type exists that satisSes 
the identity h(u) = h*(u) for all points u in P. 
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Proof. One direction is trivial: If /i*:P* — ► Q* is linear, then the restriction of A* 
to the hyperplane P in P* will be an affine map h: P — ► Q*. If h* is also flavor- 
preserving, it must map 1-flavored tensors to 1-flavored tensors, so we will have 

h(P) c q. 

In the reverse direction, suppose that h:P — > Q is affine, and let [ro,...,r p ] 
be an affine frame for P. The points r t of the frame, interpreted as 1-tensors in 
P*, form a linear basis for P*. Hence, the p + 1 conditions h*(ri) := hfa) serve to 
uniquely define a linear map /i*:P* — ► <?*. The restriction of Ji* to P will agree 
with h on P because both maps are affine and they agree on an affine frame for P. 
Furthermore, the map h* preserves the flavor of each basis element r», since both 
Ti and /t*(r<) = /i(r») are 1-flavored; since ft* is linear, it must preserve flavors in 
general. □ 

If we apply Prop. 5.2 to the affine blossom /®: P® n Q of a polynomial map 
P:P — > Q, we can convert that affine blossom into the linear blossom of F, which 
is a flavor-preserving linear map f£:P* n -* <?*, as shown in the lower-right box in 
Fig. 5.1. The intuitive idea is that, while the affine blossom /® demands a 1-flavored 
n- tensor on P as its argument and returns a 1-flavored 1-tensor on Q as its result, 
the linear blossom /® will accept an n- tensor on P of any flavor as its argument, 
and will return as its result a 1-tensor on Q of that same flavor. 

Strictly speaking, there are three different ways to interpret the expression Pf n , 
which is used in Fig. 5.1 to denote the domain of the linear blossom ff . Fortunately, 
the three interpretations give canonically isomorphic results. The simplest choice 
is to interpret Pf n as JC* [P], the set of all n- tensors of any flavor on P. That 
was the interpretation that we assumed in the last paragraph. When we apply 
Prop. 5.2 to the affine blossom /®, however, the domain space of the resulting 
linear blossom ff is actually (P® n )*, which, by our definitions, is the iterated 
tensor space (P® n )* = (^"[P])* = i'*[^i n [^]]- The distinction arises here because 
we have chosen a particular linear space, constructed in a particular way, to be the 
linearization P* of an affine space P. If we were being more abstract, we would define 
a linearization of P to be a triple (W 9 h t <p) where W is a linear space, h:P — ► W 
is a one-to-one affine map, and <p:W — ► R is a linear flavor functional, satisfying 
h(P) = y> _1 (l). It is straightforward to check that all linearizations W of a space P 
are canonically isomorphic. In particular, the two linearizations P® n and (P® n )* 
of P® n are isomorphic. 

The third choice is to interpret the expression P® n to mean (P*)® n , that 
is, the nth linear symmetric tensor power of the linearization of P. This is the 
interpretation that arises when using the linear variant of the Tensoring Principle 
to move down from the middle-right to the lower-right box in Fig. 5.1. We haven't 
studied how to compute tensor powers in the linear world, but, if we did, we would 
find that the space (P*)® n is also a linearization of the affine symmetric tensor 
power P® n . Thus, all three ways of parsing the domain of the linear blossom give 
the same result. 

Exercise 5.3: The Tensoring Principle, symmetric linear variant. Develop 
the theory of symmetric tensor powers in the linear world by substituting linear 
notions for affine ones in the theory of Section 4, while removing any notion of 
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flavor. Let W be a linear space. For each element w ofB^, let V w be a formal 
variable, and let Y[W] be the resulting algebra of y-polynomials. Define what it 
means for two ^-polynomials to be linearly equivalent, and let y[W] be the resulting 
algebra of equivalence classes. Prove that symmetric, multilinear maps with domain 
W n are equivalent to linear maps with domain W® n := l/ n [W], 

There are other variants of the Homogenizing Principle that can move us from 
the affine world to the linear world at either the middle or the top level of Fig. 5.1. 
In each case, the homogenized map in the linear world has a property related to fla- 
vor whose effect is to guarantee that 1-flavored arguments are carried to 1-flavored 
results. The multilinear blossom /*:(P*) n -> of F is flavor-multiplicative, 
meaning that Flav(/*(e!, . . . ,e n )) = flLi Flav(e t ). The homogeneous polynomial 
form P*:P* — ► of the polynomial map F is flavor- exponentiating , meaning that 
Flav(P*(e)) = (Flav(c)) n . 

Exercise 5.4. If P is an affine space, show that the two spaces (P n )* and (P*) n 
are quite different. Hence, the parentheses used above in describing the domain 
(P*) n of the multilinear blossom /* cannot be omitted. 

We can get a more concrete perspective on our three principles by watching 
them in action on an explicit polynomial map, given in terms of coordinates. In 
particular, let us write down all six guises of the cubic polynomial curve F:L — ♦ Q, 
sitting in a 3-space Q 9 that is given in Cartesian coordinates for L and Q by F(u) = 
F((u)) = (X(u),Y(u) y Z(u)) where 

X(u) := 7u 3 + 6u 2 - 3u + 4 

Y(u) := lu 8 + 3u 3 + 9u-5 (5.5) 
Z(u) := 2u 3 - 3u 2 + 12u - 8. 

Moving down the left column of Fig. 5.1, the multiaffine blossom f:L s — ► Q 
of F will be the map f(Q u u%,Qz) = (x(u u u 2j us) } y(u 1} U2yUs)yZ(u u U2 } us)) given 
by 

x(ui,u 2 ,ii 3 ) := 7uiu 2 U3 + 2uiu 2 + 2uiu s + 2u 2 u s - lu t - lu 2 - lu 8 + 4 
y(«i,«2,«3) := ltiiu 2 us + luiu 2 + luius + lujus + 3u! + 3u 2 + 3u 8 - 5 
z(ui,u 2 ,u 3 ) := 2u 1 u 2 u 3 - lu x u 2 - luiu s - lti 2 u s + 4u 1 + 4u 2 + 4u s -8. 

In general, as we saw in Section 2, blossoming a curve takes a term of the form cu k 
in a polynomial of degree n and turns it into (£) terms, each with coefficient c/(£). 

Continuing down the left column of Fig. 5.1, we next want to write a formula for 
the affine blossom /® . In order to do so, we must first choose a coordinate system for 
the third tensor power L® 3 of the affine line L. Let 5 = 1-0 denote the unit vector 
on L. In this context, the natural coordinate system to use for Z/® s is the one that 
expresses a 1-flavored 3-tensor e in the form e = (a,6,c) = aS z + bS 2 0 + cSO 2 + 0 s . 
We can convert any tensor e in L® 3 into this coordinate system by making use of 
the formula u = uS + 6; for example, the product 0iu 2 u 3 of three points becomes 

«iu 2 u s = uiu 2 u 3 £ 3 + (uxu 2 + uiu 3 + u 2 u 3 )£ 2 0 + (ui + u 2 + u s )60 2 + 0 s . 
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Comparing this expansion to our formula for the multiaffine blossom /, we deduce 
that the affine blossom /® is given by /®(c) = <x®(e),y®(e),z®(e)), where 

(a8 z + b6 2 0 + cSO 2 + 6 3 ) := 7a + 2b - lc + 4 
y®{a8 3 + b8 2 6 + cSO 2 + 0 3 ) := la + lb + 3c - 5 
z® (aS s + bS 2 0 + cSO 2 + 0 3 ) := 2a - 16 + 4c - 8. 

The trick of rewriting a tensor on L in terms of the basis {6,0} for L* is a 
generalization of Prop. 4.3 that is worth remembering. Recall that, in Prop. 4.3, 
we chose an affine frame [ro,...,r p ] for P and then rewrote any tensor [$] on P 
as an X-polynomial p(&) in the corresponding frame variables X ro through X Tp . 
Instead of starting with an affine frame for P, we could start, more generally, with 
any linear basis {&o> - • ■ >b p } for P*. In this more general setting, we don't get 
-Y-polynomials any more, because there are no X-variables associated with 1-tensors 
on P that aren't 1-flavored. But, if we treat the 1-tensors bo through b p themselves as 
variables, it is still the case that every tensor con? has a normal form /3(e) that is a 
polynomial in the variables bo through 6 p . This normal form correspondence /? is an 
algebra isomorphism between the tensor algebra X [P] and the algebra R[&o> • • • > b p ] 
of polynomials in the variables bo through b p over the reals. The one thing to be 
careful about in this more general context is computing the flavor of a normal- 
form polynomial /3(e) in R[6o, . . • , b p ]. So far, the flavor of any polynomial has been 
simply the sum of its coefficients, which corresponds to replacing each of its variables 
by 1 and then evaluating. To compute the flavor of a polynomial in R[6o> - • • , b p ]> 
however, we replace each variable 6^ by the flavor of the corresponding tensor 6» and 
then evaluate; this guarantees that Flav(£(e)) = Flav(e). 

Proposition 5.6. Let {bo, . • . , b p } be any line&r basis for the linearization P* of an 
affine space P, and let /3 be the associated normal-form map, winch rewrites each 
1-tensor in some expression for a tensor e on P as a linear combination of the basis 
tensors bo through b p . The normal-form map /? is an algebra isomorphism between 
the symmetric tensor algebra X [P] of P and the algebra R[6 0 , . . . , b p ] of polynomials 
in the p+ 1 independent variables bo through b p . □ 

The bases {6o> • • • > b p } that are useful in practice tend to have elements 6 t that 
are either points or vectors, that is, 1-tensors of flavor either 1 or 0. As we saw 
earlier, a barycentric coordinate system for P corresponds to a basis for P* that 
contains only points, while a Cartesian coordinate system for P corresponds to a 
basis for P* that contains one point and all the rest vectors. We shall refer to these 
types of bases for P* in the future as barycentric bases and Cartesian bases. 

We return to the study of the explicit cubic curve F:L — ► Q. In order to deal 
with the right-hand column in Fig. 5.1, we must extend our coordinate systems for 
L and Q to their linearizations L* and We shall do so by following our standard 
recipe for Cartesian bases: adding a flavor coordinate at the end. That is, we shall 
write a 1-tensor in L* as (u; r) = uS + rO, where r is its flavor, and we shall write 
a 1-tensor in Q* as (x,y, z\w), where w is the flavor. Note that we have u = (u; 1) 
and S = (1;0). With these conventions, we can construct the homogenized form F* 
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of F as follows: F*«u;r» = (X*(u;r) } Y*(u;r),Z^u;r)]W*{u;r)) where 

X*(u; r) := 7u 3 + 6u 2 r - 3ur 2 + 4r 3 
n(u;r) :=lu 3 + 3u 2 r+ 9ur 2 - 5r 3 

(u; r) := 2u 3 - 3u 2 r + 12ur 2 - 8r s (5 7) 

W*(u;r) :=0u 3 +0u 2 r + Our 2 + lr 3 . 

We first set the output flavor W*(u;r) to be the constant 1. Then, we homogenize 
all four output coordinates by adding as many factors of the input flavor r to each 
term as necessary in order to bring that term up to total degree 3. 

The multilinear blossom /* is constructed analogously. We have /*(ei, e 2 , e 3 ) = 
{x*(e 1 ,e2,e 3 ),y*(ei,e2,e 3 ),z*(c 1 ,e2,C3);t^*(ci ) e2 ) «3)) where e, = (u»;r») for t in 
[1,3] and the four functions x*, y*, z*, and w* are given respectively by the rows 
of the table 

7uiu 2 u 3 + 2u x u 2 r 3 + 2uir 2 u 3 + 2riu 2 u 3 - luir 2 r 3 - lfiu 2 r 3 - lrxr 2 u 3 + 4rir 2 r 3 
luiu 2 u 3 + lu x u 2 r s + luir 2 u 3 + lriu 2 u 3 + 3uir 2 r 3 + 3riu 2 r 3 + 3rir 2 u 3 - 5rir 2 r 3 
2u!U 2 u 3 - luiu 2 r 3 - lu!r 2 u 3 - lr x u 2 ti 3 + 4uir 2 r 3 + 4riu 2 r 3 + 4rir 2 u 3 - 8r!r 2 r 3 
0uiu 2 u 3 + 0uiu 2 r 3 + 0uir 2 u 3 + 0riu 2 u 3 + 0uir 2 r 3 + 0riu 2 r 3 + 0rir 2 u 3 + lr x r 2 r 3 

Our last challenge is to write a formula for the linear blossom f£. To construct 
the requisite coordinate system for L® 3 , we shall use the normal-form map (3 for 
tensors on_L associated with the basis {8,0} of L*: e = (a,6,c;rf) where 0(e) = 
aS s + b8 2 0 + c80 2 + <i0 3 . Using this coordinate system, we find that /f (e) = 
{^(e),y?(e),z®(e);^(e)), where 



xf(a8 3 + b8 2 0 + c80 2 + dd s ) 
yf(aS z + b8 2 Q + cSQ 2 + dQ*) 
zf{a8 s + b8 2 0 + c8Q 2 + d0 3 ) 
wf(a8 3 + b8 2 0 + c80 2 + d6 3 ) 



= 7a + 2b - lc + Ad 
= la + lb + 3c - 5d 
= 2a - 16 + 4c - 8d 
= 0a + 0b + 0c+ Id. 



(5.8) 



Note how the matrix of coefficients of F changes as we apply the various prin- 
ciples. Blossoming takes the column of coefficients of u k and divides it up into 
(2) equal parts. Tensoring eliminates the duplicate columns that blossoming intro- 
duced, by building in the symmetry condition. Homogenizing adds one more row, 
whose coefficients have a special unit form. 

One advantage of the homogenized approach is that it allows us to interpret 
the columns of coefficients as geometric objects. In our original formula for F in 
Eq. (5.5), the constant column (4, -5, -8) was obviously the point F(0) in Q. But, 
even though the other columns also had three components, they weren't points in 
any obvious sense. Jumping to the linear blossom in Eq. (5.8), we find that its 
four 4-element columns of coefficients, from left to right, are just the following four 
1-tensors on Q: f?(8 3 ), ff(8 2 G) y ff (50 2 ), and /,?(0 3 ). The first three are vec- 
tors on Q y since they are O-flavored, while the fourth is a point in Q. If we rescale 
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the argument tensor S k 0 n ~ k by (£), we arrive at a geometric interpretation for the 
columns of coefficients of the homogeneous polynomial map F* in Eq. (5.7). Com- 
paring Eq. (5.7) to the original function F in Eq. (5.5), we conclude the following. 

Proposition 5.9: The Power Basis for curves. Let L be the aJRne line, and let 
S be the unit vector on L. The n-tensors (fyO n ~ k 6 k for k in [0, n] form a linear basis, 
called the power basis, for the space Lf n . The coefficients of u k in the formulas for 
the coordinates of the point F(u) on a polynomial curve F: L — ► Q of degree n are 
precisely the non-flavor coordinates of the 1-tensor ff ((l)0 n ~ k 6 k ). □ 

Thus, specifying a polynomial curve F by giving the coordinates of F(u) as 
polynomials in u corresponds to specifying the linear blossom ff by giving the 
images of the tensors in the power basis. The Bezier technique for specifying a 
polynomial curve isn't very different; it just uses a different basis, as the following 
restatement of Prop. 3.3 tells us. 

Proposition 5.10: B4zier Bases for curves. Let [s,i] be a reference interval 
in the affine line L. Tien, the n-tensors § n ~ k i k for k in [0,n] form a linear basis, 
called a Bezier basis, for the space Lf n . The images of the Bezier basis tensors 
under the linear blossom ff are precisely the Bezier points of the curve segment 
F{[S,t]). □ 

One important difference between the power basis and a Bezier basis is that 
all of the n-tensors in the Bezier basis are 1-flavored. Hence, a B6zier basis for 
Lf n is also an affine frame for L® n . Because of this property, we shall sometimes 
say Bezier frame instead of Bezier basis. This also explains why the Bezier points 
f®(s n ~ k i k ) of a curve are values of the affine blossom /®, as well as being values 
of the linear blossom ff. 

These insights all generalize from curves to surfaces quite straightforwardly. Let 
P be an affine 2-space, and let (u, v) be a Cartesian coordinate system on P. Let o 
denote the point (0,0), let 6 U denote the unit vector on P in the u direction, and let 
S v denote the unit vector in the v direction; then, (u, v) — o + uS u + v6 v . The power 
basis for the space Pf n is the set of tensors of the form (. Jj^jo*, where t, j, 
and k are nonnegative integers that sum to n. If F : P — ► Q is a polynomial surface 
of degree n, the coefficients of u x v 3 in the formulas for the coordinates of the point 
F((u t v)) are precisely the non-flavor coordinates of the tensor ff ((^ " J^i^o*) • 
Alternatively, as described in Prop. 3.8, the Bezier basis for Pf n consists of the 
tensors r*s J t fc , where Arst is a reference triangle for P. 

6. The de Casteljau technique for specifying curves 

By means of blossoming, tensoring, and homogenizing, we have found that 
specifying a polyonomial curve F:L — ► Q of degree n is the same thing as specifying 
its linear blossom, which is a flavor-preserving linear function ff: Lf n -+ Q*. The 
obvious way to specify a linear function is to give its values on a basis for its 
domain. Using the power basis {(^)S n ~ k 0 k ) for the domain Lf n , we just learned, 
corresponds to the elementary approach of specifying the curve F by giving the 
coordinates of the point F(u) as polynomials in u. Using a Bezier basis {s n ~ k i k } 
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for Lf n corresponds to specifying the curve F by giving the Bezier points of the 
segment F([§, t ]). But any other basis for Lf n would do just as well. In this section, 
we shall investigate a class of bases that might be called de Casteljau bases, because 
they will enable us to specify a curve by specifying a collection of blossom values 
(poles, in de Casteljau's terminology [14]) of a fairly general form. As in the case of 
a Bezier basis, the n-tensors in a de Casteljau basis are always 1-flavored. Thus, a 
de Casteljau basis can also be viewed as an affine frame for £® n , and, in that role, 
will be called a de Casteljau frame. 

Let (fi, . . . , r 2n ) be a reference sequence of 2n points in the parameter space 
L. For k in [0,n], let e k denote the 1-flavored n-tensor e k = f k +i •••fjt +n on L\ 
that is, each e k is the tensor product of n consecutive points fi from the sequence 
(fx, ... , f 2ri ). The idea behind de Casteljau frames is to use the n + 1 tensors e k 
for k in [0,n] as an affine frame for L® n . Note that the Bezier frame with reference 
interval [s, t ] is a special case of a de Casteljau frame in which the first n points of 
the reference sequence, f x through r n , are all equal to s and the last n points, r n+1 
through f 2n , are all equal to t. Of course, not all reference sequences are legal; we 
have to put some constraints on the points F t * in order to guarantee that the tensors 
e k will be affinely independent, and will hence form a frame. 

Lemma 6.1: de Casteljau frames. Let (fi,. . . ,f 2n ) be a reference sequence of 
2n points in the affine line L. Then, the n-tensors e k := r k + % • • • f k + n for k in [0, n] 
form an affine frame for the tensor power space L® n if and only if we have ry ^ r n +» 
for 1 < t < j < n. 

Proof. (Note that the stated condition does hold in the case of a Bezier frame, since 
r i = 8 > f nH-« = t, and s < t.) 

We shall tackle the easier half of the proof first. Suppose that ry = r n +» for 
some t and j with 1 < t < j < n. We must show that the n-tensors e k for A: in 
[0, n] fail to be affinely independent. In fact, we shall show that there is an affine 
dependence among the j -i+2 tensors {e<_i, . . . , ey}. If we let q denote the common 
value of ry and r n + iy we can write out the tensors e<_i through cy as the rows of a 
parallelogram with constant columns: 

ri+i ... q fy +1 . . . r n +<_i 
f i+1 ... q fy +1 ... r n +i_i q 

q fy+i ... fn+t-1 q ... fn+y-i 

ry +1 ... F n+t _i q ... f n+ y_! f n+ y 

Note that each row includes all of the points fy+ x through f n +i_! as factors. In 
addition, each row also includes at least one q. Therefore, every row has the form 

Ui • • • uy_» q fy +1 • • • f n +i_x 

for some points u x through uy_». But, as we vary tii through tiy_» in L, the resulting 
tensors all lie in an affine flat of dimension (j - i) in L® n . Since the rows above 
denote j -i + 2 tensors that all lie in that flat, they must be affinely dependent. 
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In the other direction, suppose that ry ^ r n+ » for 1 < % < j < n. We must show 
that the tensors e k are affinely independent as elements of L® n , or, equivalently, 
that they are linearly independent as elements of the linearization Lf n . To test 
this latter condition, we shall expand the tensors e k into coordinates in terms of a 
known basis for Lf n and then evaluate a determinant. As our known basis, we will 
use the tensors Q n ~ l 8 l for / in [0,n], where 8 denotes the unit vector on L — that is, 
the tensors of the power basis, but without the binomial coefficient scaling factor. 

Expanding the tensor e k in terms of this known basis, we have 

e k = r k+1 • • • fjfc+ n = II (5 + r *+* 5 ) 

l<t<n 

0<l<n 

where a\ denotes the /th elementary symmetric function 

/c{i ( ...,n} iei 

l/l=« 

Therefore, the tensors e k will be independent if and only if det(M) ^ 0 where 
M = (m w ) is the (n + l)-by-(n + 1) matrix given by m kl = <ri(r k + u . . . ,r*+„) for 
k and / in [0,n]. 

To compute det(M), we shall use the same trick that is frequently used to 
compute the Vandermonde determinant. Since ^(rjfe+i, . . . , r fc+n ) is a homogeneous 
polynomial of degree / in the variables r», the quantity det(Af) must be given by a 
homogeneous polynomial of degree ( n J l ) in the variables r*. On the other hand, we 
already know ( n + l ) distinct linear factors of det(M), since we know, from the first 
half of this proof, that det(Af) = 0 whenever ry = r n +< for 1 < t < j < n. Hence, 
we must have 

det(M) = w J] (r n +< - ry) (6.2) 
i<»<i<« 

for some real constant w. To determine w, consider the coefficient of the term 
T := n i <i<n r n+i +1 - 1x1 the nght-hand-side of Eq. (6.2), the coefficient of T is 
clearly 1, since we must choose the r n +» term out of each binomial factor. In 
det(M), the only way to get the term T is to go down the main diagonal of M, 
taking the term r n +i • • • r n+fc from each m kk = <r k (r k +i,. . . , r fc+n ); hence, we again 
get a coefficient of 1, showing that w — 1. We conclude that the e k can be dependent 
only if at least one of the equalities ry = r n +» holds. □ 

Corollary 6.3: The de Casteljau technique for curves. Let ri, r 2n be 
reai numbers satisfying the condition that ry ^ r n+i for 1 < t < j < n. Polynomial 
curves F:L — ► Q of degree n are in one-to-one correspondence with (n + l)-tupies 
(x 0 , . . . ,x n ) of points in Q under the correspondence f{r k +i, . . . , f fc + n ) = x k for k 
in [0,n], where f denotes the multiafRne blossom of F. □ 

In the de Casteljau technique, the curve F is controlled by specifying the points 
ff{e k ) = /(f fc+ i, . . . , fjb+n)> which, following de Casteljau, we shall call poles. When 
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we study spline curves in Section 15, we will find that specifying curve segments by 
giving their poles with respect to reference sequences that are substrings of the knot 
sequence will make it easy to assemble those segments into a spline curve. In the 
context of spline curves, however, these poles have another name; they are known 
as the de Boor points of the spline curve [5], 

Challenge 6.4. Is the Tensoring Principle worth it? By using the Tensoring Prin- 
ciple in this section, we were able to reduce the study of de Casteljau frames to a 
straightforward problem in linear algebra: testing linear independence. As we shall 
find in Section 15, this means that we have already finished the hardest part of our 
development of the theory of spline curves. On the other hand, it took us quite a bit 
of work to construct the tensor power space L® n . Contrast the tensoring approach 
with more standard developments of the de Boor theory of spline curves. 



Part C: Perspectives on Blossoming 

What do the n different arguments to a blossom mean? Can we think of the 
blossom value /(tii,...,u n ) as the result of evaluating the n-ic curve F(u) at n 
different times? In the next three sections, we shall study this question from three 
perspectives: algorithmic, differential, and algebraic. 

7. The algorithmic view of a blossom's n arguments 

Prom an algorithmic perspective, the meaning of the original time parameter 
u of a curve F(u) is a choice of interpolation ratio. Given the Bezier points of 
the curve segment F([*,t]) and given the ratio of the lengths of the line segments 
into which the point u divides the segment [ff,f], the de Casteljau Algorithm tells 
us how to construct the point F(u) = /(u, . . . ,u) by performing n stages of linear 
interpolations with that ratio. 

It makes perfect sense, however, to use different ratios for the different stages 
of the de Casteljau Algorithm. If we use the ratio corresponding to u, during the 
tth stage, the point that we end up constructing is precisely the blossom value 
/({*!, . . . ,u n ). In more detail, the tth stage produces, as output, all of the blossom 
values whose argument bags consist of the t points tii through t^, filled out with some 
collection of (n - t ) copies of either 5 or i. Switching from the multiaffine blossom / 
to the affine blossom in order to avoid horizontal braces, the tth stage computes 
the point /® (tii • • • Uii n ~ i " k i k ) for k in [0, n - t] by using the recurrence 

/®(th (7.1) 

to interpolate between two adjacent outputs of the previous stage. For example, the 
left half of Fig. 7.2 shows the construction of the point /(4,3,2) = /(2,3,4) from 
the Bezier points of a cubic segment i^([0,6]) by interpolating with the ratios 2/3, 
1/2, and 1/3, in that order. 

The symmetry of the blossom / implies that using the same bag of ratios in 
a different order will result in the same final point, even though the intermediate 
points and lines will be different. For example, the right half of Fig. 7.2 uses the 
ratios in the order 1/3, 2/3, and 1/2. Thus, the algorithmic way to interpret the n 
arguments of a curve's blossom is that each of them provides the ratio to use during 
one of the n stages of the de Casteljau Algorithm. 

This insight works just as well for surfaces as it does for curves. If F: P — ► Q 
is a polynomial surface, any point u in P specifies a two-dimensional interpolation 
ratio. That is, the location of u with respect to the vertices of the reference triangle 
for P gives a way of choosing a corresponding point in the plane determined by any 
three points. Given the Bezier points of F, the de Casteljau Algorithm performs n 
stages of these two-dimensional interpolations in order to construct the point F(\i). 
Once again, it makes perfect sense to use different interpolation ratios during the 
different stages, and the final result does not depend on the order we choose. Ronald 
N. Goldman gets the credit for realizing that two-dimensional interpolation stages 
with different ratios commute with each other [25]. 
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Fig. 7.2. Different ratios in different stages of the de Casteljau Algorithm 

The same insight also applies to curves controlled with the de Casteljau tech- 
nique of Section 6, in which poles are specified instead of Bezier points. Recall that, 
in that technique, an n-ic curve F is controlled by specifying the blossom values 
/(ffc+i, . . • , ffc+ n ) for A: in [0, n], where (fi, . . . , r2 n ) is a sequence of 2n points in L 
satisfying the nondegeneracy conditions ry ^ r n +» for 1 < t < j < n. The output of 
the t th stage of the de Casteljau Algorithm in this case is the collection of blossom 
values /(u x , . . - , u», r k +i+ x , . . . , r fc + n ) for k in [0, n - •]. The t th stage produces this 
output by using the formula 

f(u ly . . . , fi,-, F*+,+i > . . . , f k+n ) = (7.3) 
/ r fc+nH -x - Uj \ _ 

I ; ; I ••ittt-i,r*+<,-.-,r Jb+n ) + 

/ tit - f fc+i \ 

i J /lwi,...,u,_i,rjb + » +1 ,...,r fc+n+1 J 

to interpolate between two adjacent outputs of the previous stage. Just as in the 
Bezier case, each blossom argument u« controls the interpolations that happen dur- 
ing one stage, and the arguments can be used in any order without altering the final 
result. We shall refer to the algorithm embodied in Eq. (7.3) as the Generalized de 
Casteljau Algorithm, to distinguish it from the special case in Eq. (7.1). When the 
Generalized de Casteljau Algorithm arises in the context of spline curves, however, 
we shall follow standard practice by referring to it as the de Boor Algorithm [5], 

Note that the different interpolations that make up the ith stage of the Gen- 
eralized de Casteljau Algorithm involve different ratios. In particular, the A;th in- 
terpolation of the tth stage is controlled by the location of ti t - with respect to the 
interval [r k + iy f k + n + x ). 

Note also that the nondegeneracy conditions on the u are exactly what is 
needed to guarantee that none of the ("J 1 ) denominators that arise when applying 
Eq. (7.3) is zero. This observation provides an alternative proof of Lemma 6.1 
that is computationally easier than our determinant-based proof in Section 6. If 
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Fig. 7.4. The Generalized de Casteljau Algorithm when n = 3 

r j r n+* for 1 < 1 < j < n, Eq. (7.3) with the /'s omitted shows that every 
simple n-tensor ui * • • u n can be expressed as an affine combination of the tensors 
€k — nt+i • • • Fjb+n for A: in [0, n]. Since the simple n- tensors span all of L® n , that is 
enough to conclude that the {e*} form an affine frame for L® n . The determinant- 
based proof in Section 6 is conceptually simpler, however, in the sense that it attacks 
a problem in linear algebra with the standard techniques of linear algebra, without 
recourse to an auxiliary algorithm. 

Fig. 7.4 shows the computation scheme of the Generalized de Casteljau Algo- 
rithm in the particular case n = 3 in the form of a triangular array of blossom values. 
The four blossom values across the top are the poles of a cubic polynomial curve 
F with respect to a reference sequence (fi,...^). The points in lower rows are 
computed as affine combinations of their two ancestors with the indicated weights, 
as Eq. (7.3) specifies. It is not at all obvious, from looking at this computation 
scheme, that it computes a function /(ui,u 2 ,u 3 ) that is symmetric. 

Just as the de Casteljau Algorithm can help us to understand blossoms, we 
can also use blossoms to help understand the de Casteljau Algorithm. For example, 
let F be a cubic curve, and suppose that we have used the de Casteljau Algorithm 
to construct the point F(v) = /(v,v,v) for some v in [0,1] from the Bezier points 
of the segment F([0 f I]). In the process of doing so, we will also have constructed 
the points /(0,0, v) and /(0,v,v). One of the important properties of the Bezier-de 
Casteljau theory is that the four points /(6,0,0), /(0,6,v), /(0,v,v), and f(v,v y v) 
are, in fact, the Bezier points of the initial segment ,F([0,t;]) of F([t), 1]). Viewed 
algorithmically, this corresponds to the fact that two quite different construction 
processes always give the same result: For any u in [0,v], one way to compute F{u) 
starts with the B6zier points of the entire curve segment F([0 y 1]) and uses the de 
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Fig. 7.5. Repeated subdivision of a cubic curve 

Casteljau Algorithm with ratio 0 : u : 1. Alternatively, we could start with the four 
purported Bezier points of the initial subsegment ^([O, v]) and use the de Casteljau 
Algorithm with ratio 0 : u : v. Thinking algorithmically, it isn't very clear why 
these two computations should produce the same answer. But, once we understand 
about blossoms, we can draw Fig. 7.5, which clarifies the situation. 

Exercise 7.6. There are two intersections of line segments in Fig. 7.5 that are not 
labeled as blossom values. Convince yourself that those two intersections arise only 
because the four Bezier points of the segment jP([0, 1]) are coplanar, causing the 
resulting cubic F to be degenerate. In fact, those two intersections are drawn in 
Fig. 7.5 so as to suggest that one of the lines is above the other in each case. If the 
first three Bezier points of F are assumed to lie in the plane of the paper, where 
should the fourth Bezier point /(1, 1, 1) be placed with respect to the plane of the 
paper to result in the crossings as drawn? 

Exercise 7.7. Consider quadratic polynomial curves, that is, parabolas. Note that 
the reference sequence (1,0,3,2) of length 4 satisfies the nondegeneracy conditions 
of the de Casteljau technique with n = 2. Hence, we can specify a parabola F 
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uniquely by giving arbitrary values to the three poles_/(l,0), /(0,3), and /(3,2). 
Sketch an example. Construct the blossom value /(1,2) from the three given poles 
in two different ways. What does this have to do with the well-known fact that the 
centroid of a triangle is two-thirds of the way from any vertex to the midpoint of 
the opposite side? 

8. The differential view of a blossom's n arguments 

Recall that, for a quadratic curve F(u) 9 the blossom value f{s,i) is simply the 
intersection of the tangent lines to the parabola F(u) at u = s and at u = t. For a 
second perspective on the meaning of the n arguments of a blossom, we would like 
to generalize this observation. In order to do so, we must study how differentiating 
interacts with blossoming. This is one area where the Homogenizing Principle is 
quite helpful. 

As a warmup, let F: L — > Q be a cubic polynomial curve. One possible correct 
formula for the derivative F'(u) in terms of the blossom / is 

F'(u) = 3(/(iTFT,fl,tt) - /(*,«,!*)). (8.1) 

Since / is affine in its first argument, there are lots of other ways to write this, 
including F'(u) = /(iT+3, u, u) - /(u, u, u) and F'{u) = 3 (f(v + 1, u, u) - /(v, u, u)) 
for any v in L. Note that, when we subtract one point from another point in an 
affine space such as Q> the result of the subtraction is a vector, not a point. Recall 
from Section 5 that, associated with any affine space Q } there is a linear space Qq of 
vectors on Q\ the spaces Qq and Q = Qi are parallel hyperplanes in the linearization 
of Q. The derivative of a curve F is a vector- valued map F': L — ► Qq. 
The easiest way to prove Eq. (8.1) is to invoke more of the technology of Sec- 
tion 5. We learned in that section that a polynomial curve F: L — ► Q of degree 
n is equivalent to a flavor-preserving linear map ff:Lf n — ► Q* under the corre- 
spondence F(u) = ff (ti n ). In particular, we can consider the curve F to be the 
composition of two functions: the nth-power map <p: L — ► L® w , which takes a point 
u to the 1-flavored n- tensor <p(u) := u n on L, followed by the linear map ff . Since 
linear maps commute with differentiation, we conclude that F'(u) = ff(<p'(u)). 
The derivative of <p is easy to compute using the Power Rule: 

v du du 
where 6 denotes the unit vector on L. Therefore, we have ^(o) = ff (nu n " 1 £) = 
nff (u n ~ l 6). To translate this answer back from the linear blossom ff to the 
multilinear blossom /*, we have to resort to a horizontal brace: 

F'^-nA^^^u). 

In order to translate back to the affine world, we must choose some way of wr iting 
the vector 8 as a difference of points. If we choose the substitution 6 := u + 1 — u, 
we get F'(u) = n(ff (iTF!*"" 1 ) - ff(u n ))> or equivalently, 

F'{Q) - n(/(SrTT,fi i: ^ 1 fl) - /(^vu)), 

n— 1 n 
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which is the generalization of Eq. (8.1) to arbitrary degrees n. 

The good news is that differentiating is quite easy in the linear world: just 
substitute the unit vector S for one of the copies of the argument point u. The bad 
news is the annoying factor of n. That factor of n arises because, when we follow the 
curve F(u), we vary all n of the arguments of the multiaffine blossom /(u, . . . , u) in 
parallel. Suppose that we held all but one argument of / fixed at u while we varied 
that single argument. The resulting function G(v) := /(tJ,u, . . . ,u) is affine, and 
its (constant) first derivative is precisely f?(u n ' 1 S) 9 without the factor of n. The 
effect, to first order, of varying all n arguments in parallel is to cause the resulting 
point to move n times faster. 

By repeating the argument above, we can compute derivatives of any order. 
The falling factorial power notation is helpful in this context: let denote the 
product n(n - 1) • • • (n - k + 1). 

Proposition 8.2. Let F;L->Q be an n-ic polynomial curve, and let ff \ Lf n 
Q* be the linear blossom of F. The kth derivative of F is given by the formula 

FM(u) = n±f?(u n - k 6 k ). □ 

Note that this formula gives the correct answer when k = 0, since = 1. For 
k > n, the factor of takes the right-hand side to 0, which is also correct. If we 
want to replace the linear blossom ff on the right-hand side with the multiaffine 
blossom /, we can just replace each factor of S by an equivalent difference of points, 
such as u + 1 - S. This leads to the formula 

* W (0) = nA £ 0) il 0) f 

o<y< fc V// > - ' > — * — ' 

which expresses the &th derivative of F as a Arth difference of values of the multiaffine 
blossom /. If we are interested in the endpoint derivatives F^(s) of the curve 
segment F([5,t|), we could use the substitution S := (i - a)/(t - s) instead, which 
leads to the well-known formula 



F<*>(*) = 




for an endpoint derivative as a fcth difference of the Bezier points. 

Prop, 8.2 gives us another way to prove Prop. 5.9, which related the coefficients 
of the polynomials that define a curve F(u) to the effect of the linear blossom /? 
on the tensors in the power basis for Lf n . From the theory of Taylor series, we 
know that the coefficient of u k in the polynomial expansion for ^(u) is the vector 
FW(0)/k\. Applying Prop. 8.2, we can rewrite that quantity as (n£/k\)ff (0 n " k S k ) > 
which is precisely the image of the Arth element (£)0 n ~*£* of the power basis under 
the linear blossom ff . 

In order to state the analogous formulas for surfaces, we have to agree on a 
notation for the derivatives of a function F:P^Q whose domain is 2-dimensional. 
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If u is a point in P, let dF u : Pq — ► Qo, given by 

dFM -£» 1 . 

denote the linear map that approximates the local behavior of F to first order near 
u. This map is variously called either the derivative of F at u or the differential 
of F at u. The quantity dF n (£) is also frequently written D^F(u) and, in those 
contexts, it is called the directional derivative of F in the £ direction at the point u. 
Which notation one uses doesn't make much difference; we will arbitrarily choose 
the former for now. The Jfcth derivative of F at the point u is a symmetric Ar-linear 
map d k F J1 :(Po) h — ► Q 0 , and a value of this map is written d k F u (£x, . . . , for 
vectors & on P. In the directional derivative notation, this same quantity would 
be written • • • D^ h F(u). To specialize these general notions to the particular 
case of curves, the convention is to use the unique unit vector S on L for every 
direction & that arises. Thus, for a curve F:L — ► Q> we have F'(u) = dF u (S) and 
FW(*) = d k F*{6 9 ... t S). 

Proposition 8.4: Derivatives in terms of blossoms. Let F:P — ► Q be a 
polynomial function of degree n, and let ff: Pf n be the linear blossom of F. 

The kth derivative of F is given by 

d k F u (ti,- • • , e*) = «V*®(u n - fc ei • • • □ 

One conclusion that we can draw from these formulas is that the more that we 
want to know about the behavior of a function F near a point, the more arguments to 
its multiaffine blossom we must be willing to vary away from that point. Computing 
a Jbth derivative of F at the point n involves varying k of the blossom arguments 
away from u. If we fix all of the blossom arguments at u, our knowledge of F is 
restricted to zeroth order, that is, to the value F(u) itself. If we can vary all n of the 
blossom arguments, we can compute F to nth order, that is, we know F completely. 

At this point, one must be careful to avoid falling into a tempting trap. Let's 
think about an n-ic polynomial curve F:L—>Q for simplicity. Let G: L — ► Q be the 
unique polynomial curve of degree k that agrees with F to kth order at the point 
f; the curves F and G are said to osculate to kth order at F. We can construct G 
by truncating the Taylor series of F: 

G(u) := F (f) + F'(r)(u - r) + . • • + ^^(« - r) k . 

It would be wonderfully simple if the A>affine blossom g of G could be obtained 
simply by fixing n — k of the arguments of the n-affine blossom / of F at the point 
r — that is, if the formula 

g(u u . . . , u fc ) = f(u u . . , , 0*, r^^r) 

n-k 

held. Unfortunately, that formula does not hold, except in the trivial cases k = 0 
and k — n. One way to see that it couldn't hold is to consider the case k = 1 and 
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Fig. 8.5. Two affine maps that osculate a parabola 

n = 2, when we are approximating a parabola F(u) by an affine function G f (u) for 
u near r.^ If the formula above were true, the point /(M ) would be both G f (s) and 
also G?(i ), that is, both the image of § under the best affine approximation to F at 
t and also the image of t under the best affine approximation to F at s. In reality, 
however, as shown in Fig. 8.5, the two points G f (s) and G 9 (i) do not coincide; 
instead, we have G f (s) = 2f(sJ) - /(*,*), while G 9 {t) = 2/(5,*) - /(*, s). The 
source of the problem is the annoying factor of n- in Prop. 8.2. 

To determine the relationship that really does hold between the blossoms of F 
and G, we shall translate from multiaffine blossoms to linear blossoms. In that lan- 
guage, the tempting falsehood in the last paragraph states that gf(e) = ff (ef n "*) 
for any tensor e in Lf k . The truth is that there is a bijective flavor-preserving linear 
map m:L$ k Lf k with the property that gf(e) = ff(m(e)r n ' k ); but the map 
m is not the identity. To compute the map m, let us use the tensors 8*r k ~ i for t 
in [0,*] as a basis for L$ k . From Prop. 8.2, we know that G<*)(r) = fcgf (£*>*"•). 
Similarly for F, we know that F^(f) = nifffrr n "»). Since G osculates F to Jfeth 
order at f , these two quantities are equal. Hence, we conclude that 

Let m: L®* Z,®* be the linear map whose action on the basis tensors is given by 
m(£»r := (ni/fc*)$V by the linearity of the linear blossoms ff and gf , we 
may conclude that (c) = ff (m(e)f for all c in L$ k . 

All of this theory works perfectly well for surfaces as well as for curves. The 
surface G of degree k that osculates a surface F: P — ► Q of degree n to Jfcth order 
near a point r is abstractly defined by truncating the Taylor series: 

G(u) := F(r) + dF r (u - r) + • ■ • + u -r) ^ 

By the same reasoning as above, using Prop. 8.4 this time, we can conclude that 
there is a flavor-preserving linear bijection m: Pf k P$ k with the property that 
9* (e) = f*(m(e)r n - k } for all fc-tensors e on P. If the two vectors £ and tf are 
a basis of P 0i the effect of m on a basis tensor £*ij'r*~*~ J " of Pf k is given by 
m(eVr*-'- J ) := (n*±l/k!±l)?r)>r k - i -> . 
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Proposition 8.6. If F:P — ► Q is an n-ic polynomial function, the unique polyno- 
mial function G:P — ► Q of degree k that osculates F to kth order at the point r 
in P is given by the formula gf(e) ~ ff{m(e)r n ' k ), where m:P® k -+ P®* is the 
invertible flavor-preserving linear map determined as follows. Let {Si, . . . , 8 P } be a 
basis of Po. Then, the set of k-tensors of the form 6[ l • • • 6p p r k ~ % form a basis for 

P® fc , where i := 1 1 H H t p . The matrix of the map m with respect to this basis 

is diagonal, and the entries on the diagonal are determined by 

Note that the primary source of complexity in Prop. 8.6 was the annoying 
factors that arise when differentiating; indeed, the sole effect of the map m is to 
undo the damage caused by those factors. It is often more convenient to wash out 
the effects of the annoying factors by thinking about osculating flats (that is, affine 
subspaces) instead of osculating curves or surfaces. Define Osc* F(r), the flat that 
osculates F to ifeth order at r, to be the affine span of the range of G, where G is the 
curve or surface of degree k that osculates F to fcth order at r. For a nondegenerate 
curve F, the flats Osc* F(r) will have dimension k. In particular, the flat Osci F(f) 
is the tangent line to F at r, while the flat Osc2 F{f) is the osculating plane of F 
at f, the plane containing the point F(f) and spanned by the vectors P'(r) and 
F"(f). For a nondegenerate surface F, the flat Osci P(r) is a tangent plane, while 
the flat Osc2 F(r) is an affine 5-space containing the point F(r) and spanned by the 
five vectors dF r (£), dF T {ri), d 2 F T {£,t), d 2 F r (Z,ri), and rf 2 P r (f/,»?), where is 
a basis for P 0 . 

The relation between osculating flats and blossoms can be computed as fol- 
lows. If G is the fc-ic function that osculates F to Arth order at r, the definition 
of osculating flats says that Osc fc P(r) = Span(Range(G)). Prop. 4.7 tells us that 
Span(Range(G)) = Range(^®); thus, the flat Osc fc P(r) is precisely the locus of the 
points gf (e) as e ranges over all 1-flavored A;-tensors on P. From Prop. 8.6, we learn 
that gf(e) = f£(m(e)r n ~ k ). Since m is a flavor-preserving bijection, we conclude 
that Osc fc P(r) is also precisely the locus of ff(ev n " k ) as e varies over P®*. Notice 
how dealing with osculating flats, rather than curves or surfaces, washes out the 
effects of the annoying factors. 

Translating back into multiaffine blossoms, we conclude that the range of the 
fc-affine map 

(ui,...,u fc ) /(Ui,...,U fc ,]^^) 

n-k 

lies inside of OsCfcP(r), and also spans it. The "lying inside" part is particularly 
important. 

Proposition 8.7. If a point r in P occurs m times among the arguments of a 
value /(ill,. • - ,u») of the n-affine blossom f, then that blossom value lies in the 
flatOsc n _ m P(r). □ 

This concept of osculating flats leads to a very natural differential perspective 
on the meaning of the n arguments of a multiaffine blossom. For example, let 
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F: L — ► Q be a nondegenerate polynomial cubic curve, and hence a space curve; and 
let f , s, and i be distinct points in L. The blossom value / (r , S , i ) is the unique point 
that lies in the intersection of the three osculating planes Oscj P(f), Oscj and 
Osc2 F(i). The value /(s, «, f) is the intersection of the tangent line Osci F(s) with 
the osculating plane Osc 2 F(i). More generally, for any nondegenerate polynomial 
curve or surface P, we can determine the blossom value /(ui, . . . ,u n ) uniquely by 
intersecting the osculating flats corresponding to all of the distinct points r that 
occur in the argument bag {ui, . . . ,u n }. 

This differential intuition is so simple that one might be tempted to use it to 
define blossoms, instead of manipulating the coefficients of polynomials as we did 
in Section 2. Unfortunately, that idea runs into trouble if the polynomial curve or 
surface is degenerate, that is, if its Bezier points are not aflinely independent. For 
example, if F is a polynomial cubic curve that happens to lie in a plane (that is, 
has zero torsion), all of its osculating planes Osc 2 F(u) are coincident, and hence we 
can't use their intersections to define the blossom values /(r,s, i). 

Note that the osculating flats of surfaces interact with each other somewhat 
more subtly than do the osculating flats of curves. For example, let F(u) be a 
nondegenerate quadratic polynomial surface. Since F has six Bezier points, the 
affine span of the range of F is a 5-flat. The blossom value / (u, v) is the intersection 
of the two tangent planes Osci P(u) and Osci P(v). The subtle point is that we 
would expect two planes in a 5-flat to be skew; since these two planes intersect 
in the point /(u,v), they aren't in general position. As the degree of the surface 
goes up, these coincidences get worse. A nondegenerate cubic surface F lies in a 
9-flat. For each point u, the flat Osc 2 P(u) has dimension 5. Three 5-flats in general 
position in a 9-space don't come close to intersecting; but any three 5-flats of the 
form Osc 2 P(u), Osc 2 P(v), and Osc 2 P(w) do intersect, at the point /(u,v,w). 

Working with osculating flats instead of osculating functions is one way to wash 
out the effects of the annoying factors. But there is another important context where 
the annoying factors aren't a problem. In particular, if we compare the behaviors 
of two different functions of the same degree, then the annoying factors cancel out, 
because they are the same on both sides. 

Proposition 8.8. IfF:P-+Q and G:P->Q are two polynomial functions of the 
same degree n, then F and G agree to kth order at r in P if and only if the identity 

/• (ux . . . u k r n - k ) = g® (u x • . . u*r n -*) 

holds, that is, if and only if the blossoms f and g agree on all argument bags that 
include at least n — Ar copies of r. 

Proof Suppose first that the blossoms do agree as specified. We can rephrase this 
as the claim that f?(er n - k ) = gf(er n " k ) for all simple, 1-flavored it-tensors c on 
P. Since those tensors span all of P+ k , we deduce that the same equality holds 
for every A:- tensor e on P. Note that, in fact, it would have been enough to have 
assumed that /®(u*r n "*) = g®(u k r n - k ) for all points u in P, since the A:-tensors 
u* that are perfect A;th powers also span P? k y by Cor. 4.6. 
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Let £**P r (£i, . - . , &) be a value of the tth derivative of P at r, for t in [0, k]. By 
Prop. 8.4, we have ..,&) = ni/f (r n "^i • • • fc), and similarly for G. We 

can deduce that these derivatives agree by letting the fc-tensor e above assume the 

value e := £i • • • ZiT k ~ % - 

Conversely, suppose that F and G agree to kth order at r, and let . . . , S p } 
be a basis for P 0 . The k- tensors of the form c := S 3l •••5j.r /c " i for % in [0,fc] 
form a basis for P? k . Furthermore, by Prop. 8.4 again, we have /f(cr n " fc ) = 
(l/n*)<Pi'V($j l , . . . , 8j.) } and similarly for G. If the ith derivatives of F and G agree, 
then it follows that ff{cv n ~ k ) = 0®(cr n- *) for all c in the basis, hence for all c in 
P®*, and hence, in particular, for all c of the form c = u x • • • u*. □ 

Prop. 8.8 will be the starting point in Section 14 for our investigation of spline 
curves via the blossoming technology. 

9. The algebraic view of a blossom's n arguments 

Our third perspective on the meaning of a blossom's n arguments is a trivial 
observation with important consequences: the n arguments of a blossom value are 
the n factors of a simple n-tensor. This observation exploits the Tensoring Principle 
to give a clearer understanding of the geometric relationships between a polynomial 
function and its multiaffine blossom. Note that the range of the polynomial function 
F itself is the curve or surface that we started with, while the range of P's multiaffine 
blossom / is the set of points for which we have good labels. So far, / has provided 
us with labels for all of the important points, such as Bezier points and poles. But 
there are points that / doesn't label. 

The Tensoring Principle tells us that a polynomial map P: P Q is equivalent 
to an affine map /• : P® n — ► Q, and an affine map is a pretty simple thing. Instead 
of trying to understand the geometry of P(u) = /®(u n ) and its multiaffine blossom 
/(ui,...,u n ) = in the object space Q, it is a better idea to study 

the geometry of the corresponding argument n-tensors u n and Ui • • • u n bade in the 
tensor power space P® n . The geometry in P® n is independent of P. Furthermore, 
since the map /® is affine, it can affect that geometry only in two fairly simple 
ways. First, the map /® may fail to be surjective (onto). Prop 4.7 tells us that the 
range of /® is precisely the flat in Q that forms the affine span of the range of P, or 
equivalently, the affine span of the Bezier points of P with respect to some reference 
simplex in P. This range flat /®(P® n ) might be smaller than Q. But however big 
it is, it contains all of the points of interest for the study of P. By studying the 
geometry of P® n , we can avoid being distracted by the irrelevant points in Q that 
don't lie in /®(P® n ). Second, the map /® may fail to be injective (one-to-one). 
This happens precisely when the Bezier points of P are not affinely independent, 
causing the map /® to collapse its domain P® n down to a range flat /®(P® n ) of 
smaller dimension. This type of degeneracy makes it particularly attractive to study 
the geometry back in P® n rather than in Q, where things have collapsed on top of 
each other. 

To study the geometry of P® n , we shall invoke Prop. 5.6. Let {b 0 , . - . ,& p } be 
a linear basis for the linearization P* of P, and let fi be the associated normal- 
form map. Prop. 5.6 tells us that 0 is an algebra isomorphism between X[P] and 
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R[6o, . . . , bp]. Thus, questions about the structure of the space P® n = JC"[P] can 
be translated, using 0 y into questions about 1-flavored n-homogeneous polynomials 
in R[6o, . . . » bp]. The question that interests us at the moment is whether or not a 
tensor e in P® n can be written as a product of n points e = iii • • -u n , since those 
are the tensors that correspond under /® to points that have multiaffine labels. 

If e is a tensor in P® n , the normal-form polynomial /3(e) is 1-flavored and 
n-homogeneous. In order for a factorization e = Ui...u n to exist, the poly- 
nomial 0(e) must factor as the product 0(e) = 0(vii) •••£(u n ) of n 1-flavored 
1-homogeneous polynomials 0(xii). In fact, it is enough that 0(e) split in any way 
into n linear factors. It is a standard algebraic result that any factor of a homo- 
geneous polynomial must be homogeneous, so the homogeneity of the factors is 
guaranteed. As for their flavor, since the product is 1-flavored, we can always push 
scalars around so that each factor is also 1-flavored. Thus, an n-tensor e on P can 
be written as a product of n points if and only if its normal-form polynomial 0(e) 
splits into linear factors. 

This notion of "splits into linear factors" is important enough that it has a 
name. An n-tensor e on P (of any flavor) is called simple if it can be factored into a 
product of 1-tensors; otherwise, it is called compound. (The term "decomposable" 
is sometimes used instead of "simple.") We started out building the tensor power 
space P® n by multiplying points together; the tensors that result directly from this 
multiplication are the simple ones. The compound ones are the ones that have to 
be thrown in as well in order to make P® n into an affine space. 

For example, consider the 2-flavored 2-tensor e on the line L given by e := 
0 2 + 2 12, and use the Cartesian basis {0,5} for L*. Converting to normal form, we 
have 

/3(e) = 0 2 + (0 + 26) (0+125) 
= 20 2 + U06 + 246 2 
= 2(0 + 35)(0 + 4$). 

Since 0(e) splits, the tensor e is simple; in fact, e = 2 34. For an example in the 
other direction, note that the tensor c = 0 2 + S 2 is already in normal form. Since 
(3(c) = c does not split (over the real numbers), the tensor c is compound. 

In the rest of this section, we shall try to develop some intuition for the geometry 
of the set of simple tensors sitting in P® n for small values of n and p := dim(P). 
We begin by reviewing the case of quadratic curves, where n — 2 and p = 1. 

The second tensor power L® 2 of the affine line L consists of homogeneous 
quadratic polynomials in two variables with one flavor constraint. If we use the 
Cartesian basis {0,5} for L*, a tensor e in L® 2 has the normal form 0(e) = 0 2 + 
bOS + cS 2 for some constants b and c, where the flavor constraint is embodied in the 
fact that the coefficient of 0 2 is 1. We can use the pair (6, c) as a Cartesian coordinate 
system on the plane L® 2 , The factoring properties of the tensor e depend upon the 
sign of the discriminant Disc^(e) = b 2 - 4c. (The subscript of 0 warns us that 
the value of the discriminant, although not the sign of that value, depends upon 
the chosen basis for L*.) If Disc^(e) = b 2 — Ac = 0, then e is a perfect square 
e = u 2 = (0 + uS) 2 = 0 2 + 2u0S + u 2 6 2 . Let D denote the set of such tensors 
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Fig. 9.1. An affine picture of the space L® 2 of 2-tensors on L 

( a D n for double root). Note that D forms a parabola D(u) := (2u,u 2 ) in the plane 
L® 2 , as shown in Fig. 9.1. If Disc^(e) > 0, then e has two distinct real roots: 
e = uv = 0 2 + (u + v)0S + uv8 2 for u ^ v. Let R denote the set of such tensors 
(u + v y uv) ("R" for real roots); R is the exterior of D. Finally, if Disc^(e) < 0, 
then e doesn't factor over the reals. Call the set of such tensors C (for compound 
tensors); C is the interior of D. 

If we enlarged our coefficient field from the real numbers to the complexes, 
then the tensors in C would also factor, with a pair of complex conjugate roots. 
For example, the 2-tensor (0, 1) = 0 2 + S 2 in C, which is compound over the reals, 
splits into (<5 + t$)(0 — iS) over the complexes. Indeed, every n-tensor on L for every 
n would then be simple, since the complex numbers are algebraically closed. But 
the real numbers are the natural coefficients to use in computer graphics, so we 
shall stick with them. Moving to the complexes would only eliminate the problem 
of compound tensors for curves, anyway. For surfaces, as we will see later on, most 
tensors are compound even over the complex numbers. 

One difficulty with our current picture of the space L® 2 is that some of the 
interesting things are happening out at infinity, where they are hard to see. With a 
little work, we can get a finite picture of all of the simple 2-tensors on L, that is, of 
RUD. Indeed, some readers may have already realized that, viewed projectively, the 
set R U D is a Mobius strip. We shall derive this fact in slow and careful algebraic 
detail in the next few paragraphs. The advantage of our plodding derivation is that 
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a closely analogous method will then suffice to compute a finite picture of the simple 
3-tensors on L — a picture that is less well-known. The first step on our plodding 
road is to invoke projective geometry to explain what it means for things to be 
happening "at infinity." 

Consider the linearization Lf 2 of £® 2 . This linearization is a 3-space consisting 
of the tensors (6,c;a) = a0 2 + bOS + cS 2 , where we are writing the flavor-coordinate a 
last out of habit. The basic idea of projective geometry is to treat the lines through 
the origin of Lf 2 as the "points" of a new space: a projective plane. Most of the lines 
through the origin of Lf 2 contain precisely one point of each flavor; in particular, 
the same line that contains the point (6,c;a) also contains (i/a,c/a;l) whenever 
a 0. Such lines model "finite points" in the projective plane. The other lines 
consist entirely of points that are Q-flavored. Such lines model "points at infinity." 
We can study things that happen everywhere in L® 2 , including "at infinity," by 
studying a neighborhood of the origin in the homogeneous linear space Lf . 

Carrying out this plan, we find that the factoring properties of a tensor e = 
(6,c;a) in the 3-space Lf 2 depend upon the homogenized form of the discriminant 
Disc^(e) := b 2 - 4ac. The set D of tensors with a double root is described by the 
equation Disc^(e) = 0, and forms a double cone with elliptical cross section, centered 
around the line a — c, 6 = 0. The set R of tensors with two distinct real roots has 
Disc^(c) > 0, and forms the exterior of this cone, while the set C of compound 
tensors has Disc^(e) < 0 and forms the interior of the cone. 

The second step on our plodding road involves using some type of projection to 
reduce this 3-dimensional picture down to a 2-dimensional one. Projection, in this 
context, merely means restricting our attention to some 2-dimensional surface in 
the 3-space Lf 2 that cuts all (or most) of the lines through the origin in one point 
(or a small number of points). The most obvious surface to project down onto is a 
plane. If we pick any plane A in Lf 2 not containing the origin, most lines through 
the origin will intersect A in a unique point. The lines that don't intersect A form a 
plane through the origin parallel to A. In projective language, those lines are viewed 
as "points at infinity," which, together, make up the "line at infinity." The problem 
with such projections is that every plane through the origin of Lf 2 intersects the 
exterior R of the cone, and hence includes some lines of simple tensors. Since we 
want to get a single, finite picture of all of the simple tensors, we can't afford to 
miss such a line. For example, if we picked the plane A := L® 2 , this projection 
would take us right back to Fig. 9.1, where we started, in which the entire "line at 
infinity" consists of simple tensors. 

Instead of projecting onto a plane, we shall project onto an infinite cylinder, as 
in the Mercator projection used in map-making. The advantage of this choice is that 
only one line through the origin escapes intersecting our chosen surface: the axis of 
the cylinder. We can choose things so that this line consists of compound tensors, 
which we don't mind ignoring. One disadvantage of using a cylinder is that each line 
through the origin that does intersect the cylinder does so twice, in an antipodal 
pair of points. In order to achieve our final picture, we will have to identify the two 
points in such an antipodal pair. 

To make the geometry of the cylindrical projection simpler, it is helpful to 
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Fig. 9.2. A projective picture of the space L® 2 of 2-tensors on L 

convert to a new Cartesian coordinate system for L% 2 . In particular, let x := 6, 
y ~ c - a, and z := c + a. In the (x,y,z) coordinate system, the discriminant has 
the simpler form Disc^(c) = x 2 + y 2 - z 2 . The cone Z> determined by the equation 
Disc^e) = 0 now has circular cross sections, an apex angle of 90 degrees, and is 
centered around the z-axis. If we imagine a copy of the earth centered at the origin 
of Lf 2 and with its poles on the z-axis, the cone D intersects that earth along the 
parallels of 45 degrees north and south latitude. 

We now project if 7 onto the cylinder of radius 1 about the z axis; that is, we 
restrict our attention to tensors (x,y,z) that satisfy x 2 + y 2 = 1. Fix an angle 0, 
let u := cos0 and v := sin0, and consider the set of tensors t$ — {(cos0,sinfl,z) | 
z € R}, which is a generating line of the cylinder. The line 1% corresponds to a 
particular meridian of longitude in the Mercator analogy. For tensors e in we 
have Disc(e) = 1 - z 2 . Hence, the line 1$ intersects D in the points <costf,sin0,±l) 
and intersects R in the points (cos0,sin0,z) for z in (-1,1). Fig. 9.2 shows the 
cylinder unrolled onto the (9,z) plane. Remember that each line through the origin 
of Lf 2 intersects this cylinder in two antipodal points: one with 6 in [0,jr) and 
the other with 0 in (tt,2t). Thus, the set of simple 2-tensors on L is, up to a 
diffeomorphism, the shaded region in Fig. 9.2 with its left and right edges identified 
as indicated by the arrows. That is, the simple 2-tensors RuD on L form a Mobius 
strip with the perfect squares D as its single edge. 

To understand cubic curves, we must study the geometry of L® 3 . Continuing 
with the Cartesian basis {0,5} for L*, a generic tensor e in L® 3 has the normal- 
form polynomial 0(e) = 0 3 + b0 7 8 + cQ6* + d6 s , and the triple c = (6,c,rf) forms 
a Cartesian coordinate system on I® 3 . From a factorization point of view, the 
tensors e in L® 3 can be partitioned into four classes: T, the tensors with one triple 



9.3 



THE ALGEBRAIC VIEW 



51 



real root; D y the tensors with one double real root and one single real root; R y 
the tensors with three distinct real roots; and C, the compound tensors. (Viewed 
over the complexes, the tensors in C have one real root and one pair of conjugate 
complex roots.) From the classical theory of cubic polynomials, we recall that the 
discriminant of the polynomial /3(e) is given by 

Disc^(e) := 6 V - 4c 3 - 4b s d - 27 d 2 + 186cd. 

The set R contains the tensors whose discriminant is strictly positive; the set C 
contains the tensors whose discriminant is strictly negative; and all of the tensors in 
DuT have zero discriminant. Note that DUT constitutes the boundary between 
R and C, since the discriminant varies smoothly. The set T, which forms part of 
this boundary, consists of the perfect cubes T(u) — u 3 = (3u,3u 2 ,u 3 ), and is hence 
a twisted cubic curve lying in the 3-space X® 3 . 

Fig. 9.3 is the analog, in the cubic case, of the parabolic picture in Fig. 9.1. Of 
course, the space L® 3 has three dimensions instead of two. Instead of attempting to 
draw a perspective view, Fig. 9.3 shows four different cross sections, corresponding 
to the planes 6 = -3, b — 0, b = 3, and 6 = 6. If we put the c axis vertically, as done 
in Fig. 9.3, each cross section of the form 6 = 3A looks rather like an ocean wave, 
with R as the water, C as the air, D as the water's surface, and T as the cusp of the 
wave. To plot any cross section of L® 3 , it is enough to compute the intersection of 
the cutting plane with the boundary D U T, or equivalently, to compute all tensors 
of the form u 2 v that lie in the cutting plane. In Cartesian coordinates, such a tensor 
is given by: e - u 2 v - (0 + uS) 2 (0 + vS) = (2u + v,u 2 + 2uv,u 2 v). Cutting by a 
plane of the form b = 3A for some real constant A is particularly simple, since the 
relation b = 2u + v = 3A gives us a linear relation between u and v, which allows us 
to express both c and d as cubic polynomials in either u or v. That is, the curve in 
which the boundary surface DuT meets the plane 6 = 3A is a polynomial cubic. In 
fact, it is always an affine image of a semi-cubical parabola, with the cusp located 
on T at the tensor A 3 . 

Fig. 9.3 also illustrates another interesting point. A tensor of the form e — a 2 i 
in D UT must lie on the tangent line Osc! T(s) to the cubic curve 7*(ti) at u = s, as 
we can see by applying Prop. 8.7. Therefore, the boundary surface DuT is actually 
a ruled surface, with the tangent lines to the curve T(u) as its generators. The four 
circled points in Fig. 9.3 indicate the intersections of the tangent line Osci T(l) with 
the four drawn cross sections. 

This first picture of the geometry of the simple tensors surrounding a cubic 
curve has the same problem that our first picture had in the quadratic case: lots 
of the interesting behavior is out at infinity. By following in the footsteps of E. C. 
Zeeman, we can obtain a finite picture of all of the simple 3-tensors on L, that is, 
an analog to the Mobius strip of Fig. 9.2 for cubics. Zeeman invented this finite 
picture for use in catastrophe theory, and he christened it the umbilic bracelet [39]. 

The first step is to projectivize. Instead of considering the affine 3-space L® 3 , 
we consider lines through the origin of the linear 4-space Lf s . The factorization 
properties of a 3-tensor c = <6,c,d; a) = a0 3 + b6 2 S + cOS 2 + dS z are determined by 
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the homogenized form of the cubic discriminant: 

Dis C/9 (e) := b 2 c 2 - 4ac 3 - 4b s d - 27a 2 d 2 + I8abcd. 

The next step to take isn't so clear, however. We shall simply follow Zeeman, 
treating his ideas like rabbits pulled from hats. Zeeman J s own presentation of the 
umbilic bracelet in [39] is better motivated, at the price of invoking the machinery 
of group actions. 

Zeeman first changes to a new (ti,v,x, y) Cartesian coordinate system for Lf z 
by the following rules: u := (a - c)/4, v := (d - 6)/4, x := (3a + c)/4, and y := 
(-3d - 6)/4. A little algebra reveals that, in terms of these new coordinates, the 
discriminant Disc^(e) is given by 

4(27(u 2 + v 2 ) 2 -18(u 2 + ^ 

Note that, if u = v = 0, then Disc /? (e) = (-4)(x 2 + y 2 ) 2 . Thus, the lines through 
the origin of Lf s that lie in the plane u = v = 0 all consist of tensors with negative 
discriminant, which are hence compound. Since we are searching for a finite picture 
of the simple tensors, we can safely ignore the plane u = v = 0. Every tensor 
(u,v,x,y) that does not satisfy u = v — 0 has two scalar multiplies that satisfy 
u 2 + v 2 = 1. We shall project Lf z onto the set of tensors satisfying u 2 + v 2 = 1, 
which is a solid torus: the Cartesian product of the unit circle in the (u,v) plane 
with the entire (x, y) plane. 

Fix an angle 0, let u := cos0 and t; ~ sinl, and consider the factorization 
structure of the 3-tensors in the plane A 9 = {{cos0,sm0,x,y) | x,y G R}. For 
tensors e in A$> we have 

Disc^(c) = 4(27-18(x 2 +y 2 )-(x 2 + y 2 ) 2 + (8x 3 -24xy 2 )costf+(24x 2 y-8y 3 )sintf). 

The equation Disc^(e) = 0 is a quartic equation in x and y whose solution is some 
algebraic plane curve of degree 4. In fact, that curve turns out to be a hypocycloid 
with three cusps. In words, mark a point on a circle of radius 1; roll that circle 
around the inside of a circle of radius 3 centered at the origin of A$; and then rotate 
the hypocycloid traced by the marked point so that one of its cusps makes an angle 
of 0/3 with the x-axis. The result is the curve Disc^(e) = 0. To verify this claim 
without drowning in a sea of trigonometric identities, it is helpful to introduce a 
complex variable z :— z + iy and to view the plane An as a copy of the complex 
numbers. The hypocycloid in question is then given parametrically by the formula 

z:=e i$ ^(2e^ + e- 2i ^) (9.4) 

for (p in [0, 27r). The equations u + iv = e i$ and x + iy — z allow us to express the 
discriminant of e in A $ as follows: Disc^e) := 43fc(27 - 18zz - (zz) 2 + 8z 3 e"*'), 
where 5R denotes the real part and z denotes the complex conjugate of z. If we 
plug in the z from Eq. (9.4), it is straightforward algebra to check that Disc^(c) = 
4R(8(e*> - e- 3 **) - 4(e w * - e" 6 ^)) = 0. This verifies that all tensors on the 
hypocycloid have zero discriminant. To see that no other tensors in the plane A$ 
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Fig. 9.5. The plane A $ 



could also have zero discriminant, consider rotating a line in the plane so that 
it remains tangent to the hypocycloid. Counting multiplicities, this rotating line 
always intersects the hypocycloid a total of four times. In general, the point of 
tangency is a double intersection and the two crossings of the other two sides are 
each single intersections. When the point of tangency coincides with a cusp, that 
intersection has multiplicity 3 and the midpoint of the opposite side provides the 
fourth. Since the discriminant is a polynomial of degree 4, the curve that it implicitly 
defines can't intersect any line more than 4 times; therefore, the hypocycloid must 
account for all of the solutions of Disc^(e) = 0 in the plane 

We can now deduce the factorization properties of all of the tensors in the 
plane as shown in Fig. 9.5. Since the origin (cos 0, sin 0,0,0) has discriminant 
4 • 27 = 108, which is positive, all the tensors inside the hypocycloid belong to i?, 
while the ones outside belong to C The tensors on the hypocycloid itself belong to 
D U T. Comparing with Fig. 9.3, it is geometrically clear that the three cusps are 
in T while the rest of the hypocycloid is in D. To verify this algebraically, we first 
verify by trigonometry that the tensor 4(cos(r/3)0 — sin(r/3)5) , when expressed 
in (u,v,x,y) coordinates, is given by (cosr,sinr,3cos(r/3),3sin(r/3)). Since the 
three cusps in A$ correspond to the three values t = 9 j t = 9 + 2it j and r = 9 + 4^, 
we conclude that the three cusps are perfect cubes, and hence lie in T. No other 
points of the hypocycloid can lie in T because the cubic curve T can intersect the 
plane A$ in at most three points. 

To achieve the finite picture of R U D U T that we have been searching for, it 
only remains to assemble the planes A$ for different 0. Fig. 9.6 shows a picture of 
the relevant portion of (0,x, y) space. Just as in the quadratic case, we restrict 9 
to the interval [0, n) in order to get only one point from each antipodal pair. As 9 
varies over this interval, the hypocycloid rotates one sixth of a turn. The plane Aq 
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Fig. 9.6. A projective picture of the space L® 8 



is identified with A T as indicated by the letters. Note that this identification does 
not destroy orientability in the cubic case, although it did in the quadratic case; the 
reason is that negating all four axes of Lf 3 is orientation preserving, while negating 
all three axes of Lf 2 is not. If we didn't care at all about compound tensors, we 
could bend the screw of simple tensors in Fig. 9.6 around, giving it an extra half- 
twist so as to bring the two identified faces Ao and A w together. If we choose the 
correct sign for the extra half-twist, the resulting solid is the umbilic bracelet shown 
in Fig. 9.7. 

Remark 9.8. In catastrophe theory, it is important not to identify two cubics 
that differ in sign, since this would confuse maxima and minima. Hence, instead of 
dealing with lines through the origin of Lf 3 , Zeeman deals with rays leaving the 
origin. He allows 0 in Fig. 9.6 to vary over [0,2tt) instead of just [0,ar) and he then 
identifies Ao with A** without any extra twisting. The resulting bracelet looks the 
same either way, however. Recent work by Jorge Stolfi suggests that computational 
geometry may be another area where it is better to keep the antipodal rays as 
separate objects, rather than to identify them, because this enables the construction 
of an oriented version of projective geometry [38]. 

Surfaces work out differently from curves in two respects. First, there are so 
many dimensions involved for even a quadratic surface that we won't waste any 
effort trying to visualize the simple tensors geometrically; instead, we will stick to 
algebra. The second difference also involves dimensionality. When studying curves, 
we found that the simple 1-flavored n-tensors on L formed an n-dimensional subset 
of the n-dimensional space L® n . Thus, not all tensors were simple, but there were as 
many dimensions of simple tensors as there were of tensors in general. For surfaces, 
that is no longer true. All of the 1-flavored n-tensors on a plane P form the space 
P® n , which has dimension ( n + 2 ) - 1 = n(n + 3)/2, by Prop. 4.9. But each simple 
1-flavored n- tensor can be written as the product e = Ui • -*u n of n points in P; 
hence, there can't be more than 2n dimensions' worth of them. This discrepancy in 
dimension corresponds to the fact that homogeneous polynomials in three variables 
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Fig. 9.7. The umbilic bracelet, as carved in maple by Tim Poston 

generally don't split into linear factors, even over the complex numbers. 

For example, consider quadratic surfaces. The study of quadratic surfaces is the 
study of P® 2 , where P is an affine plane. Prop. 4.9 tells us that P® 2 has dimension 
five. If we pick a basis {61,62,63} for P*, the normal form f3(e) of a tensor e in 
Pf 2 will be a homogeneous quadratic polynomial in the three variables 6;. It is 
technically convenient to pick a Cartesian basis, say {o,£,r?}, where o is a point in 
P and £ and r\ are vectors on P. With respect to this basis, we have 

P(e) = uo 2 + v£ 2 + wti 2 + x^V + yor) + zo£, 

where u = Flav(c). Thus, the 6-tuple (v,w,x,y,z;u) forms a Cartesian coordinate 
system on P,? 2 , and P® 2 is the subspace u = 1. A tensor e in P$ 2 is simple if 
it factors into, say, c = (a x o + 6i£ + c x r))(a 2 o + 6 2 £ + c 2 r?). Though there are 
six coefficients in this factored form, there are only five degrees of freedom in the 
product, since multiplying the first factor by A and the second by 1/A doesn't affect 
the result. Thus, the six coefficients of a general 2-tensor c must satisfy some identity 
in order for there to be any hope that e splits, even over the complex numbers. That 
identity turns out to be 

ux 2 + vy 2 + wz 2 - 4uvw - xyz = 0, (9.9) 
as the reader can easily verify. 



9.10 



THE ALGEBRAIC VIEW 



57 



Challenge 9.10, A general 3-tensor e on P has the normal form 

go 3 + ro 2 £ + 50 2 f/ + to£ 2 + uo£ri + vor? 2 + w£ 2 + x£ 2 t? + y^ 2 + zrj 3 , 

with 10 coefficients, of which the first is the flavor q = Flav(e). The simple 3-tensors 

(aio + bit + cir/)(a 2 o + & 2 £ + c 2 t7)(a 3 o + 6 3 £ + c 3 »j) 

form a 7-dimensional subset of the space P? 3 ; there are 9 coefficients, but 2 of the 
degrees of freedom just move scalars around among the three factors. This implies 
that there are 3 dimensions' worth of constraints that must hold on the coefficients 
q through z in order for the tensor e to have any hope of splitting into linear factors, 
over any coefficient field. Find a basis for the ideal of polynomials in R[?, . . . , z] 
that are zero on all simple 3-tensors. (Resultants or Grobner bases might help.) 



Part D: Adjusting the Degree 

In most of our work so far, we have taken the degree n of our curves and 
surfaces to be a manifest constant; the exception was our study of osculating curves 
and surfaces in Section 8. There are various situations where it is useful to adjust 
the degree, however. Raising the degree means viewing a function F of degree n as a 
degenerate example of a function of some degree m > n. Lowering the degree means 
approximating a function of degree n by a function of degree m for some m < n. 
After we introduce bipolynomial surfaces, we will also study degree splitting and 
degree joining. 

10. Degree raising 

A parabola F is a polynomial curve of degree two, so it has a biaffine blossom 
f(u,v). But, if we like, we can also consider F(u) to be a degenerate example of a 
cubic polynomial curve G(u) = F(u). The cubic G has a triaffine blossom g(u y v, u>). 
We know that F{u) = /(u, u) = g{u y u, u), of course; but it is interesting to ask what 
other relationships hold between the two blossoms / and g. 

It turns out to be easy to express g in terms of /: we have 



triaffine function whose diagonal agrees with F = G. For an arbitrary degree n and 
an arbitrary parameter space P, we have 



where the hat over the argument indicates that that argument is omitted. 

Both of the functions / and g that appear in Eq. (10.1) are multiaffine blossoms 
of F. Similarly, F also has two different homogeneous forms and G* , two different 
affine blossoms /• and g®, two different multilinear blossoms /* and g*> and two 
different linear blossoms ff and gf. In each case, one results from viewing F as 
having degree n, while the other results from viewing F as a degenerate function of 
degree n + 1. To distinguish between the various blossoms of the single function F , 
when confusion might arise, we shall prefix the word "blossom" with the relevant 
degree. Thus, we shall call / the multiaffine n-blossom of F, while g is the multiaffine 
(n+ 1) -blossom, and so forth. Note that, of the six guises of a polynomial map shown 
in Fig. 5.1, the polynomial map itself, in the upper-left corner, is the only one where 
there is any chance of confusion about the intended degree. 

To translate Eq. (10.1) into the world of tensors, we need to introduce an 
operator D on tensors that might be called total tensor differentiation. To begin 
with, let D:X[P] X[P] be the linear operator that takes an X-polynomial $ on 
P into the polynomial 
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For example, we have 

D (M) = ^ + ffi>= 201 + 0'. 

If $ and * in X[P] are affinely equivalent, then D($) and D(V) will also be equiv- 
alent; hence, the operator D is also well-defined as a linear map D: X[P] — ► X[P] 
on tensors. In addition to being linear, the map D has the property D(ab) = 
aD(b) + D(a)b] such maps are called derivations. In fact, D is the unique derivation 
of the tensor algebra that maps each 1-tensor into its flavor: for e in X* [P], we have 
D(e) — Flav(e). More generally, if e is a Ar-tensor, then Flav(D(e)) = ArFlav(c). 

Proposition 10.2. Let F:P -> Q be an n-ic polynomial function, let ff be the 
linear n-blossom of F, and let gf be the linear (n + l)-blossom of F, that is, the 
linear blossom of the map G:P — ► Q that results from viewing F as a degenerate 
instance of a function of degree n + 1. The functions ff and gf are related by 
the identity gf(e) = f?(D(e)/(n+ 1)), where the operator D denotes total tensor 
differentiation. 

Proof The map c i— ► D{e)/{n + 1) is a flavor-preserving linear map from P®^ n "*" ' 
to Pf n that takes u n+1 ^ u n . Thus, the composition e h-+ ff (D(e)/(n+ 1)) is 

a flavor-preserving linear map from P® (fH " i) to Q* that takes u n+1 ^ P(u). By 
Cor. 4.6, the only such map is gf , so we must have gf(e) = ff(D(e)/(n+ 1)) for 
all e. □ 

Prop. 10.2 allows us to answer many questions in a straightforward way. For 
example, if F is a parabola, what is a formula in terms of the 2-blossom / of F 
for the point ^(0^1,5), where g is the 3-blossom of F? We compute as follows: 
0(0,1,2) = </® (012) = /®(Z?(012)/3) = /®((5I + 02 + 12)/3) = (/(0,1) + /(0,5) + 
/(I,5))/3. In some contexts, it might be better to have a formula for $(0IS) as 
an affine combination of the Bezier points of F with respect to some reference 
interval, perhaps [0,1]. If so, we can apply the normal-form operator /3 associated 
with the corresponding barycentric basis for L*, the basis {0, 1} in this case. Since 
£(5) = 21-0, we find that 

3 

In other contexts, we might want a formula that labels the point $(0, 1, 5) as a single 
value of the 2-blossom /. That is a trickier question, since such a formula will exist 
only if the 2-tensor Z?(0I5) is simple. We shall return to that question later on. 

Prop. 10.2 also gives us an easy proof of the standard degree-raising formula 
for Bezier curves. Suppose that we would like to compute, from the Bezier points 
of an n-ic curve segment P([5,f]), the Bezier points of that same curve segment 
viewed as a degenerate curve of degree n + 1. If / is the n-blossom of F and g is 
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Fig. 10.3. Raising the degree of a cubic curve 
the (n + l)-blossom, we have, for 0 < i < n + 1, 

g^iH** 1 -*) = /®(D(l*f n+1 - < )/(n + 1)) 

= f ® ^ i*'- 1 *"* 1 -* + (n + 1 - t)9 i i»- i ^ 

= -4r/®(s i - 1 i"- (< - 1) ) -4-r) /®(s*t n --). 

n + 1 \ n+1/ 

Fig. 10.3 shows an example in which a cubic curve is viewed as a degenerate quartic. 

We can also use Prop. 10.2 in the reverse direction, to figure out a formula 
in terms of the (n + l)-blossom g for a point that we already know in terms of 
the n-blossom /. For example, in the case n = 2, suppose that we would like a 
formula in terms of the 3-blossom g for the point /(0, 1). We should expect to make 
some arbitrary choices in the process of producing such a formula, because of the 
degeneracy of g. From Prop. 10.2, we are searching for a 3-tensor c on L with the 
property that D{e)/Z = 01. We can find a particular solution to this inhomogeneous 
differential equation by a little experimentation: the 3-tensor e := (30 2 1 - 0 3 )/2 = 
0 2 L5 happens to work. To get an arbitrary solution of the differential equation, we 
can add to e any solution c of the homogeneous equation D(c) = 0. To determine 
the solutions of the homogeneous equation, recall that the curve G is a degenerate 
cubic, with the degeneracy that G^(u) = 0. By Prop. 8.2, this is equivalent to 
saying that gf (S z ) = 0. Since gf(e) = ff(D{e)/Z) and F is unconstrained, it 
had better be the case that D(S S ) = 0; and, indeed, that is easy to verify. In 
fact, the kernel of the linear map D is precisely the set of scalar multiples of £ 3 . 
Thus, we have /(0,1) = y®(0 3 L5 + wS z ) for any scalar w. The value w = 0 
gives /(0,I) = 0(0,0, L5), while the value w — -1/2 gives the symmetric formula 
/(0,l) = 5 (=O5,l,l). 

This argument involved a particular case of an important general pattern: a 
derivative constraint on a polynomial curve or surface F:P — ► Q corresponds to a 
linear subspace of the nth tensor power space P® n on which the linear blossom of 
F is required to be zero. As a second example of this general pattern, note that a 
cubic polynomial surface has ten Bezier points while a quadratic surface has only six. 
What are the four constraints on the Bezier points of a cubic surface that, if they 
are satisfied, imply that the surface is actually quadratic? Let G: P — ► Q denote the 
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surface, and let Arst be a reference triangle in P. The surface G will be quadratic as 
well as cubic if its third derivative vanishes identically, that is, if d s G({ u £ 2 , £ 3 ) = 0. 
To interpret this identity as constraints on the Bezier points of (7, let a := r — t, 
P := s-r, and 7 := t — s be the three sides of the reference triangle viewed as vectors 
on P. Any two of the three vectors a, and 7 form a basis for the space P 0 of 
vectors on P. If we pick, for example, a and /?, we can conclude that the constraints 
on the Bezier points of G are given by gf{a s ) = gf(a 2 /3) = gf (a0 2 ) = gf (0 3 ) = 0. 
The constraint gf (a 3 ) corresponds to the demand that the G([t,rj) edge curve of 
the triangular surface patch G(Arst) be a quadratic curve, and similarly for the 
constraint gf(p z ) = 0 and the <7([r,sj) edge curve. The bad thing about these four 
constraints as a whole is a lack of symmetry; while the four tensors a 3 , a 2 £, a/3 2 
and fi z do form a basis of the constraint space, they aren't a symmetric basis. A 
little linear algebra reveals that one symmetric basis consists of the three simple 
tensors a 3 , £ 3 , and 7 s , and the compound tensor 

e = 2r 3 + 2s 3 + 2t 3 - 3r 2 s - 3r 2 t - 3s 2 t - 3s 2 r - 3t 2 r - 3t 2 s + 12rst. 

The structure of e is easier to understand if we arrange its coefficients in a triangular 
array: 

+2r 3 
-3r 2 s - 3r 2 t 
-3rs 2 + 12rst - 3rt 2 
+2s s - 3s 2 t - 3st 2 + 2t s 

The fact that gf (e) must be zero in order for G to be a quadratic gives us an affine 
relationship on the ten Bezier points of G: if we let V := (j(r,r,r) + y(s,s,s) + 
flf(t,t,t))/3 denote the centroid of the three vertex Bezier points, C := $(r,s,t) 
denote the central Bezier point, and O denote the centroid of the remaining six 
Bezier points, the relation gf (e)/6 = 0 becomes 2C - 30 + V = 0, or, equivalently, 
O is located two- thirds of the way from V towards C. That constraint, together 
with the demands that the three edge curves of the patch G(Arst) be quadratic 
curves, is enough to guarantee that G will be a quadratic surface. 

Let us return now to the question that we postponed earlier. Let F be a 
parabola with the 2-blossom / and the 3-blossom g. Is the point $((5,1,2) in the 
range of /, or not? As we learned in Section 9, this depends upon whether the 
tensor Z>(012) is simple or compound. Using the Cartesian basis {0,$} for L*, we 
have 

D(0B)/S = 5I + 6 3 2 + i2 = ™ 3 + ™* + ™\ 
Using the Quadratic Formula, we find that this 2-tensor does factor: 

D(0I2)= ^0+^1+ S^j ^0+^1--^ S^j «0423L577. 

This time, we were lucky: we were able to factor the 2-tensor D(e) over the reals. 
The following proposition reveals that, in fact, there was no luck involved. 
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Proposition 10.4. Let D:L® (n+1 ^ Lf n denote the total differentiation opera- 
tor for tensors on an affine line L, where the coefficient field is the real numbers R. 
For all n, if a is a simple (n + l)-fcensor, then D(a) will be a simple n-tensor. But 
the converse fails in a strong way: for n > 4, there exist simple n-tensors b with the 
property that every tensor a satisfying D(a) — b is compound. 

Proof. (The result in Prop. 10.4 is quite sensitive to the coefficient field; as we 
observed in Section 9, all n-tensors on the line L are simple if we adopt the complex 
numbers as our coefficient field.) 

Let o be a simple (n + l)-tensor on L, and let's ignore the trivial case a = 0; 
we want to show that D(a) is a simple n-tensor. If a is not 0-flavored, then a can 
be written in the form a = wu\ • • • u n +i, as a scalar multiple of a product of points 
&i in P. If a is 0-flavored, however, then some of its factors are vectors rather than 
points. Let / be the largest integer such that S l divides a. Then, we have 

i<»<* 

for some real constants u* and nonzero scalar where k :— n+1— /. Since D(S) = 0, 
we have, in terms of these constants, 

D(a) = w6 n + x - k £ JJ (0 + u 3 8). 

l<i<k l<j<n 

We can make things look more familiar by performing the change of variables 0 := Y 
and 6 := -1. This change of variables carries the tensor a into the monic univariate 
polynomial A(Y) of degree k in the variable Y given by 

A(Y) := {-) n + x - k w [J (y - m). 

l<i<k 

Note that the numbers u» are precisely the roots of A(Y). Furthermore, the same 
change of variables carries D(a) into A'(Y). Thus, we are left with the following 
problem: if a polynomial A(Y) of degree k has all real roots, does its derivative 
A'{Y) also have all real roots, or might some of the roots of A'(Y) be complex? By 
Ro lie's Theorem, the function A'(Y) must have at least one real root in the interval 
between any two distinct roots of A(Y). If the k roots of A(Y) are all distinct, this 
is enough to conclude that A'(Y) has a full set of A; - 1 real roots. A polynomial 
A(Y) with multiple roots is the limit of a sequence of polynomials with clusters 
of closely-spaced distinct roots. Therefore, its derivative A'(Y) is the limit of a 
sequence of polynomials with all real roots, and hence A'(Y) itself must have all 
real roots. We conclude that D(a) is simple whenever a is simple. 

We can use the same change of variables to investigate the converse question. 
Let 6 be a nonzero simple n-tensor on L\ in fact, we shall choose b to be 1-flavored, 
so that fc = tii •••u n = rL(8 ~ tt «^)' Aft er the change of variables 0 := Y and 
S :— — 1, we are left with the monic univariate polynomial B(Y) whose roots are 
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Fig. 10.5. The graph of the function y = f*(Y + 2)(Y + 1)(Y - 1)(F - 2) dY 

the u*. An (n + l)-tensor a will satisfy D(a) = b if and only if the polynomial 
A(Y) that results from the change of variables satisfies A'{Y) = B(Y), that is, 
if and only if A(Y) is an integral of B(Y). If fB(Y) represents one integral of 
B(Y), the other integrals all differ by some constant: A(Y) = f B(Y) + C for some 
constant C of integration. Thus, we are left with the following problem: do there 
exist polynomials B(Y) with all real roots but with the property that none of their 
integrals have all real roots? Such polynomials do indeed exist for n > 4; one quartic 
example is B(Y) = (Y + 2)(Y + l)(Y - 1)(Y - 2), whose integral / B(Y) is shown in 
Fig. 10.5. No matter how we shift the z-axis up or down, we can't make it intersect 
fB(Y) more than three times. Translating this example back into the blossoming 
world, we deduce that the value /(^2,^T, 1,2) of the multiaffine 4-blossom / of a 
nondegenerate quartic F can't be labeled as a value g(ui, . . . , us) of the multiaffine 
5-blossom g of F. □ 

We can arrive at a more interesting quartic example by letting the pairs of 
roots {1,2} and {—1, —2} coalesce to form two double roots, located, say, at 8 and 
at t. The resulting polynomial, B(Y) = (Y - s) 2 (Y - 1) 2 also has the property that 
no vertical shift of its integral f B(Y) has more than three real roots. Hence, the 
point /($, 0, t, t), which is the middle of the five Bezier points of the quartic segment 
F([8, i]) y is one of the points that has no label of the form g(iii, . . . , U5). 

Raising the degree of a polynomial curve gives us one more dimension's worth 
of labels, but Prop. 10.4 shows that the set of points for which we have at least 
one label never grows as a result of degree raising and, in fact, generally shrinks. 
Surfaces work out rather differently: there is no inclusion relationship in either 
direction, in general, between the points that can be labeled before the degree is 
raised and those that can be labeled after. Furthermore, this result continues to 
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hold even if we adopt the complex numbers as the coefficient field. 

Our first example will show that, by raising the degree of a polynomial surface, 
it may be possible to label points that couldn't be labeled formerly. Let F:P — ► Q 
be a nondegenerate quadratic surface (lying in some 5-flat), let / be its 2- blossom, 
let g be its 3- blossom, and consider the central Bezier point <jf(r,s,t), where Arst is 
a reference triangle for P. By Prop. 10.2, we have ^(r,s,t) = /®((rs + rt + st)/3). 
In order for this point to lie in the range of /, we would have to be able to factor 
the polynomial 

rs + rt + st 
3 

into two linear factors. This is not possible, as we can determine simply by plugging 
the coefficients of this polynomial into the discriminant-like polynomial in Eq. (9.9). 

Our second example will show that raising the degree of a polynomial surface 
may also result in having no labels for a point that formerly did have a label. Let 
F: P — ► Q be a nondegenerate cubic polynomial surface, let / be its 3-blossom, let 
g be its 4- blossom, and consider the point /(r,s,t). We shall prove that the point 
/(r,s,t) is not in the range of g. To prove this, we must consider all 4- tensors e on 
P that satisfy D(e) — 4rst and show that none of them is simple. The proof is not 
too difficult, but it takes awhile. 

Before we begin the proof in earnest, let us count the degrees of freedom to 
check the plausibility of the result. From Prop. 4.9, we know that the tensor power 
space P® 4 has dimension (£) - 1 = 14. In this 14-dimensional affine space, the simple 
tensors e form a subset of dimension 8, since there are two degrees of freedom in 
each of the four points that are factors of c = U1U2U3U4. The set of 4- tensors c that 
satisfy D(e) = 4rst is an affine subspace of dimension 5, since a 4- tensor a satisfies 
D(a) = 0 if and only if a is a linear combination of the five tensors £ 4 , £ 3 rj, . . . , r/ 4 , 
where ( and t} are a basis for P<>. In general, we wouldn't expect a 5-flat to intersect 
a subset of dimension 8 if they are both lying in a space of dimension 14. Thus, it 
is plausible that these two sets of tensors are actually disjoint, as we claimed above. 

The straightforward way to prove disjointness would begin by characterizing the 
set of simple 4-tensors on P algebraically, giving 6 dimensions' worth of constraints 
on their coordinates. Unfortunately, I don't know how to do that in practice even 
for simple 3-tensors on P, much less simple 4-tensors; see Challenge 9.10. Since the 
straightforward attack is not available, we shall resort to an ad hoc method that 
Andrew Odlyzko suggested to me. 

Let e be a 4-tensor that satisfies D(e) = 4rst; we want to show that e cannot be 
simple. As a first step, it is convenient to change from the barycentric {r,s,t} basis 
for P* to a Cartesian basis, since that will simplify the behavior of the operator D. 
One obvious Cartesian basis is {r, t - r, s - r}; but we shall make a somewhat more 
symmetric choice, as shown in Fig. 10.6. Let q := (r + s + t)/3 be the centroid of 
Arst, let a := q — s, and let 0 := q — t; we shall use the basis {q,a,/?}. Note 
that r = q+ a + £. In terms of this basis, the condition D(e) = 4rst becomes 
D{e) = 4(q+ a + 0){q - a)(q - 0) = 4(q 3 - q(a 3 + a0 + 0 2 ) + a0(a + (3)). Since 
U(q) = 1 but D(a) = D(/3) = 0, we find upon integrating that we must have 

e = q 4 - 2q 3 (a 2 + a/3 + 0 2 ) + 4qa/?(a + 0) + Z(a,/3), 
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Fig. 10.6. Two different bases for P* 

where Z(a>P) = Yli=o ZiOL 4 ~ % P % is the constant of integration. We are left with the 
task of showing that no such 4-tensor e is simple. 

If e were simple, its four factors would be points in P, which could be ex- 
pressed in the form (q + a,a + bi/3) for scalars and 6», with i in [1,4]. Note 
that, if we substitute P = 0 in the polynomial e(q,a,/?), we are left with the poly- 
nomial e(q,a,0) = q 4 — 2q 2 a 2 + zo<x 4 . Since this polynomial only involves q to 
even powers, we deduce that the four numbers {01,03,03,04} must have the form 
{+a 9 -a i +c,—c} for some a and c. Similarly, by substituting a — 0, we deduce 
that the four numbers {b\ y 63, 63, 64} must have the form {+6, -6, +o T , —d} for some 
b and d. 

At first blush, it might seem that the numbering of the and the numbering 
of the bi could be related in 4! — 24 different ways. A little thought shows that 
there are only two really different possibilities. In Case I, the sign-reversed pairs of 
o's line up with the sign-reversed pairs of 6's, leading to the factorization 

e = (q + aa + 6/?)(q -oa - bp)(q + ca + d/3)(q - ca - dp). 

In Case II, the sign-reversed pairs do not line up, leading to the form 

e = (q + aa + bp)(q -aa- dp)(q + ca + d/3)(q -ca- bp). 

We can rule out Case I right away: any such polynomial can't involve any terms 
with q to the first power, but our e includes both 4qa 2 /3 and 4qa£ 2 . Thus, only 
Case II remains as an active possibility. 

Of the 15 coefficients of e(q,a,/?), we know the values of 10 of them, while the 
remaining 5 are the unknown constants of integration z,-. Each of the 10 known 
coefficients gives us an equation that relates the four quantities a, 6, c, and d. Of 
these 10 equations, 5 turn out to be non-trivial: 

a 2 + c 2 = 2 from the q 2 a 2 term; 

b 2 + d 2 - 2 from the q 2 /? 2 term; 
(a + c)(b + d) = 2 from the q 2 a/? term; 
(o + c)(a - c)(b - d) = 4 from the qa 2 # term; 
(a - c)(b + d)(b - d) - 4 from the qap 2 term. 
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The last two of these relations imply that (a + c) = (b + d). In combination with 
the third relation, this implies that (a + c) 2 = 2. Since the first relation tells us 
that a 2 + c 2 = 2, we deduce that 2ac = 0. Hence, the two numbers a and c are 0 
and ±\/2 in some order. Similarly, the two numbers b and d must be 0 and ±\/2. 
But this implies that (a + c)(a - c)(b - d) must be ±2\/2, while the fourth relation 
states that it should be 4 instead. This contradiction completes the proof that no 
4-tensor e satisfying D{e) = 4rst can be simple. 

11. Degree lowering 

In the last section, we studied in some detail what happens when an n-ic is 
viewed as a degenerate case of an (n + l)-ic. In this section, we shall study a more 
general problem: how can we view an n-ic as an m-ic? If m > n, this involves 
raising the degree by m - n, If m < n, it involves lowering the degree by n - m. 
Of course, we can't lower the degree in general without changing the function. If 
F is an n-ic function, lowering the degree of F to m means approximating F in 
some sense by an m-ic function G. The sense of approximation that we will use is 
osculation at a fixed point. That is, we shall choose a point r in the domain space 
P, and we shall choose G to be the unique m-ic function that osculates F to mth 
order at r. Note that, even in the degree raising case, where m > n, it is still true 
that the output m-ic function G osculates the input n-ic function F to mth order 
at r. When m > n, however, we have G = F, independent of the location of r. 

The problems of degree raising and degree lowering can be tackled with either 
the coordinate-based or the coordinate-free approach. As it happens, we have al- 
ready studied these problems from the coordinate-based approach, arriving at the 
answer described in Prop. 8.6. The following proposition restates that answer in a 
more symmetric fashion. 

Proposition 11.1: Coordinate-based raising or lowering. Let F:P — > Q be 
an n-ic map, Jet r be a point in P f and Jet {8%, . . . , 8 P } be a basis for Po- The m-ic 
map G:P -> Q that osculates F to mth order at r is given by the formula 

mht(S[ l -^r— *) = nif?(8[*---8;>T"-% 

where < = •! + ••• + t p and t lies in [0, m]. 

Proof. Prop. 8.4 tells us that the left-hand and right-hand sides are corresponding 
values of the tth derivatives of F and G at r. Note that, if m > n, the formula 
correctly specifies that the tth derivative of G should be identically zero whenever 
t > n. □ 

In the coordinate-free approach, degree raising and degree lowering come out 
quite differently. The coordinate-free approach to degree raising is an easy gener- 
alization of Eq. (10.1), but the coordinate-free approach to degree lowering is more 
complicated. 

Proposition 11.2: Coordinate-free raising. Let F:P — ► Q be an n-ic map, and 
let m > n. The m-ic map G:P — ► Q that results from viewing F as a degenerate 
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m-ic is given by 




(11.3) 



|/|=n 



Proof. Eq. (11.3) clearly defines a symmetric m-affine function g with the correct 
diagonal values. Alternatively, we could derive Eq. (11.3) by applying the total 
tensor differentiation operator D of Prop. 10.2 repeatedly, a total of m times. □ 

Proposition 11.4: Coordinate-free lowering. Let F:P -+ Q be an n-ic map, 
and let m < n. The m-ic map G:P — ► Q that osculates F to mth order at the point 
r in P is given by 

^(u 1 ...n m )=(^) £ (.)* ' /•( P * + i n u,) f (11.5) 

Vm/ JC{l,...,m} * + / V •€/ 7 

Jb=m-|/| 

where / := n — m denotes the amount fay which the degree is being lowered. 

Proof. Suppose that we begin with an m-ic map G and that we produce a degenerate 
map F of degree n by raising the degree of G by /. For such an F, the m-ic map 
that osculates F to mth order at r is just the map G that we started with, and it 
doesn't depend on r. Let us first check that Eq. (11.5) gives the correct answer in 
this special case. The blossom / of the degree-raised F will be given in terms of the 
blossom g of G by Eq. (11.3), but with the pairs (f y g) and (n,m) interchanged. To 
avoid confusion, let us write out the interchanged version: 

/*(«!•••»«) = 4r E *®(rR). (n.6) 
v»J /c{i „> x *e/ y 

|/|=m 

If we substitute Eq. (11.6) in for each occurrence of /* in Eq. (11.5), we should get 
massive cancellation, leading to the tautology ff®(u x • • -u m ) = g 9 (ui • • -u m ). 

To verify that this does happen, let /' C {l,...,m}, let k' := m - \I'\, and 
consider the term 

T:=Wr*'rK). 

V »€/' 7 

When we substitute Eq. (11.6) into Eq. (11.5), what multiple of T do we end up 
with? The term corresponding to / C {1, . . . , m} and A: = m - |/| in Eq. (11.5) will 
contribute a multiple of T precisely when i" D In particular, such an /-term will 
contribute 

K 1 k + l\ k' J 

since, when we apply Eq. (11.6) to the term /®(r*+' n, G/ «i), we have to choose 
which k' out of the k + I factors of r to keep. The number of sets / that satisfy 
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V C / C {1, . . . ,m} and have size |/| = m — Jfe is (^'). Therefore, the coefficient of 
T in the final result is given by the binomial coefficient sum 

0<k<k* x / \ / 

If k' = 0, which implies /' = {1, . . . , m}, this sum evaluates to 1 trivially; that case 
gives us the desired right-hand side g® (ui • • • u m ). If k' > 0, the sum evaluates to 
0 by straightforward binomial-coefficientology, which is left as an exercise. Thus, 
Eq. (11.5) does produce the correct answer when the F on the right-hand side was 
generated by degree raising. 

To handle the general case, let F be any n-ic map, and let G be the m-ic map 
that osculates F to mth order at r. If we raise the degree of G by /, we get an 
n-ic map H that also osculates F to mth order at r. We have just verified that, if 
we apply Eq. (11.5) to H, rather than F, we do get back G. But note that every 
blossom application on the right-hand side of Eq. (11.5) involves r as an argument 
at least / times. Since H and agree to mth order at r, we can apply Prop. 8.8 to 
deduce that their blossoms h and / will agree on all argument bags that include at 
least / copies of r. Hence, applying Eq. (11.5) to F will also return G, as claimed. □ 

By the way, if we know that the n-ic map F is actually of degree m as well as 
being of degree n, we can simplify Eq. (11.5) somewhat. Note that, in this case, 
the point r is a free parameter: it doesn't matter where we decide to approximate 
F with an m-ic since F itself is an m-ic. We can reduce the number of terms in 
Eq. (11.5) by choosing an appropriate value of the free parameter r. For example, 
if we choose r :— u l5 we get, after a little algebra, a formula with only 2 m_1 terms 
instead of 2 m terms: 

,>-•»-)=(:) £ jjM^tA^u^ (ii - 7) 

fc=m-|/| 

where / :— n — m is, once again, the amount of lowering. (A technical point: in the 
special case m = 0, this trick of setting r := Ui doesn't work, because there is no 
such thing as Ui.) 

12. Bipolynomial surfaces 

The graphics community recognizes two different types of surfaces defined by 
polynomials, and we have been focusing entirely on one of those two types so far. 
In this section, we shall redress that imbalance. 

Let F: R x R — ► Q be a polynomial function defined on the two-dimensional 
parameter space R x R. By definition, this means that each coordinate of the point 
F(u,v) is given by a polynomial in the two real variables u and v. There are two 
different types of degree bound that we could put on these coordinate polynomials. 
In a tensor product surface of degree (m;n), the coordinates of F(u,v) are required 
to have degree no higher than m in the variable u and degree no higher than n in 
the variable t; separately. In a triangular patch surface of degree n, the coordinates 
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of F(u, v) are required to have total degree no higher than n in the variables u and 
v jointly. 

The phrases "tensor product surface" and "triangular patch surface" are the 
current standard names in computer-aided geometric design for these two types 
of surfaces [5], In this paper, however, we shall uncover good reasons why both 
of those names are poor choices. Fortunately, in the latter case, we have already 
become comfortable with a better name: a "triangular patch surface" of degree 
n is precisely what we have been calling simply a polynomial surface of degree n. 
That is, if we place a bound on the total degree of the coordinates of F(u, v), the 
resulting condition is independent of the choice of affine frame for the domain space 
P = R x R, and we can reinterpret F as a polynomial function F({u>v)) = F(u) 
defined on a two-dimensional affine space P. 

In "tensor product surfaces" of degree (m;n), on the other hand, we place 
separate bounds on the degrees of the coordinates of F(u, v) in u and in v. The 
resulting condition on F does depend on the product structure of the domain R x R. 
For any fixed v, the map u v) is a polynomial function of degree m; for any 

fixed u, the map v F(u,v) is a polynomial function of degree n. Thus, it is 
quite natural to refer to F itself as a bipolynomial function of degree (m;n), in 
particular, a bipolynomial surface. We shall use that name as our replacement for 
"tensor product surface." Note that, when both degrees m and n happen to be 3, 
it is already standard practice to refer to the resulting surface as a bicubic, so the 
name "bipolynomial" shouldn't sound too strange. 

Note that every polynomial surface of degree n is also a bipolynomial surface 
of degree (n;n). Similarly, every bipolynomial surface of degree (m;n) is also a 
polynomial surface of degree m + n. Thus, the distinction between the two types of 
surfaces arises only when we enforce some bound on the degree. 

Blossoming, tensoring, and homogenizing all apply to bipolynomial functions 
F: U xV — ► Q of degree (m; n) pretty much the same way that they did to polynomial 
functions. In the remainder of this section, we shall sketch that development briefly, 
concentrating on the differences between the polynomial and bipolynomial theories. 
For brevity, we shall discuss only the case where both of the domain spaces U and 
V are affine lines. The theory works perfectly well when U and V have dimension 
greater than one, but those cases don't come up often in computer-aided geometric 
design. If there were need, we could also generalize the results below to apply to 
tripolynomial functions F: U x V x W -> Q, or, more generally, to multipolynomial 
functions F: Yl 4 Ui -> Q. 

A bipolynomial surface F:UxV Q of degree (m; n) can be viewed, in Curried 
fashion, as a higher-order function F:U (V — ► Q) 9 that is, as an m-ic curve of 
n-ic curves. More precisely, let Poly n (V,Q) denote the set of all polynomial curves 
G: V — ► Q of degree n, turned into an affine space by taking affine combinations 
pointwise. Note that, if V and Q are finite-dimensional, then Poly n (y,Q) will be 
finite-dimensioned also, so we aren't violating our prohibition against infinitely many 
dimensions. Saying that F is "an m-ic curve of n-ic curves" is really saying that we 
can view F as an m-ic function F:U -> Poly n (y,Q). Vice versa, we could also view 
F as an n-ic function F:V Poly m (Z7,Q). This Curried point of view is worth 
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Fig. 12.1. The six guises of a bipolynomial map 



keeping in mind, since it can often be used to reduce a bipolynomial problem to a 
combination of two polynomial problems. 

Fig. 12.1 provides an overview of how our three principles transform a bipoly- 
nomial surface, analogous to the overview in Fig. 5.1 of the transformations of a 
polynomial function. Given a function of any one of the six types, there exist unique 
functions of the other five types that correspond. We shall begin our whirlwind tour 
by moving down the left column. But before we do so, there is a notational ques- 
tion to address. In this paper, we have chosen to distinguish between a point f in 
the affine line L and a scalar r in the coefficient field R. Now that there are two 
different affine lines U and V involved, we have to invent some new notation to 
distinguish points in U from points in V '. We shall use slanted bars: if r is a scalar, 
let f , read "r-in," denote the point with coordinate r in the line {/, and let r, read 
w r-out, M denote the point with coordinate r in the line V. Thus, a typical point on 
the surface F will be written F(u,v). 

The multiaffine blossom of a bipolynomial surface F: U x V — ► Q of degree 
(m; n) is a multiaffine function / : U m x V n — ► Q that is symmetric in its first 
m arguments, symmetric in its last n arguments, and satisfies the correspondence 
identity F(ii, v) = /(u, . . . , u; v, . . . , v). We shall abbreviate this type of symmetry 
condition by saying that / is (m; n)- symmetric. And we shall use a semicolon instead 
of a comma to help separate the first m arguments of / from its last n arguments. 
A variant of the Blossoming Principle tells us that every bipolynomial surface has 
a unique multiaffine blossom; this variant can be proved either from scratch or by 
viewing F in Curried fashion and applying Prop. 2.1 twice. 
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Don't get confused between the multiaffine blossoms of the two common types of 
surfaces. The multiaffine blossom of a bipolynomial surface F: U x V — ► Q of degree 
(m; n) takes, as arguments, m points in U and n points in V. If we think in terms 
of coordinates, that is an ordered pair of sequences of real numbers. The multiaffine 
blossom of a polynomial surface F:P — ► Q of degree n takes, as arguments, n points 
in the plane P. If we think in terms of coordinates, that is a sequence of ordered 
pairs of real numbers. 

The Bezier theory of a bipolynomial surface starts by choosing reference inter- 
vals [p, q] for U and [r,s] for V. Each of the first m arguments of / can then be 
expressed as an affine combination of p and q, while each of the last n arguments 
is an affine combination of r and s. If we express all m + n arguments of / in this 
way and then invoke the multiaffineness of /, we get a formula that expresses the 
blossom value /(ui, . . . , u m ; t;i, . . . , t/ n ) as a sum of 2 m + n terms, each of which is 
some coefficient times a Bezier point of the rectangular surface patch F([p y q] X [r, $]), 
where the Bezier points are the (m + l)(n + 1) blossom values 

/(p^rP, q^j^q; <V^r, a^J) 

m—i % n— j j 

for t in [0, m) and j in [0, n]. 

The de Casteljau Algorithm for a bipolynomial surface F computes the blossom 
value /(ui,. . . , it m ; tJi, . . . , t/ n ) by performing m + n stages of linear interpolations, 
starting with the Bezier points. Each blossom argument provides the interpolation 
ratio for one of those stages, and the arguments can be used in any order. Fig. 12.2 
shows an example of a bipolynomial surface patch of degree (2; 1) defined on the 
rectangle [0,3] x [0,3], along with all of the construction lines that might be used 
when locating the point F(i y $) = 

We can also generalize from using Bezier frames to using de Casteljau frames. 
Let (pi, . . . ,P2m) be a reference sequence of 2m points in U that satisfies the con- 
dition p m +i 7^ p 3 for 1 < t < j < m and, similarly, let (r X} . . . , f 2n ) be a sequence of 
2n points in V that satisfies the condition r„+» ^ ry for 1 < t < j < n. Then, the 
bipolynomial surface F can be uniquely specified by arbitrarily specifying its poles, 
which are the blossom values /(pt+i, . . . ,p»-f m ; ry+i, . - . , ry+ n ) for i in [0, m] and j 
in [0,n]. 

Moving down the left-hand column of Fig. 12.1, we next come to the affine 
blossom /®. We shall begin the construction of the affine blossom /® by view- 
ing the multiaffine blossom / in a Curried fashion. Letting the {/-arguments vary 
first, and then the V-arguments, we can turn / into a symmetric, m-affine func- 
tion g: U m SMA(7 n ,Q), where SMA(V n ,Q) is the affine space consisting of all 
symmetric, multiaffine functions from V n to Q. Applying the Tensoring Principle 
from Prop. 4.5, we can convert g into an affine function g® : Z7® m — ► SMA(V n ,Q). 
Switching things around and letting the V-arguments vary first, we can view as a 
symmetric, n-affine function h: V n -* Affine(Z7® m ,Q). Applying Prop. 4.5 again, we 
get an affine function h®:V® n -+ Affine(U® m ,Q). Switching back to non-Curried 
form, we have converted / into a biaffine function /®:®: Z7® m x V® n — ► Q, which 
satisfies the identity f(ii u . . . , u m ; i u . . . , t; n ) = (u x . . . u m , v x • - • v n ). 
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Fig. 12.2. The de Casteljau Algorithm for a bipolynomial surface of degree (2; 1) 

We need a new variant of the Tensoring Principle in order to finish the job, to 
turn the biaffine function /®»® into an affine function The problem is that the 
biaffine function /® ; ® is not symmetric; indeed, its two arguments have different 
types, so it doesn't even make sense to ask whether it is symmetric. Hence, we need 
an asymmetric variant of tensoring. 

Exercise 12.3: The Tensoring Principle, asymmetric affine variant. If S 
and T are affine spaces, construct a space S® T with the property that biaffine maps 
g with domain S x T are equivalent to affine maps g® with domain S ®T under 
the correspondence g(s y t) = </®(s ® t), where the mapping (s,t) *-* s ® t is a fixed 
biaffine mapping from 5 x T to S®T. Hint: Consider -XT-polynomials in the formal 
variables X % for s in S and Y t for t in T. Choosing affine frames for S and T, define 
the normal form of an -XT-polynomial, and define when two -XT-polynomials are 
affinely equivalent. One model for S®T is then the space of affine equivalence classes 
of 1-flavored XT-polynomials that are (1; l)-homogeneous, that is, homogeneous of 
degree one in the X- variables and also homogeneous of degree one in the F-variables. 
In this model, we have 8 ® t := [<X fl Yt]. 
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Some comments on Ex. 12.3. First, the reason for using two different formal 
symbols X and Y is to allow for the case S = T, that is, to enable the construction 
of S ® S. Note that we must distinguish between Si ® 82 = Y» 3 ] and s 2 ® Si = 
[X^y.J in the case S = T, since there is no guarantee that <7(si,s 2 ) = <Ks2>Si)> 
This lack of commutativity also explains why we don't use simple multiplication 
"8182" to denote the tensor product of elements in the asymmetric variant. Warning: 
with our conventions, the spaces S ® S and S® 2 are not the same; symmetry is 
implied in the latter case, but not in the former. 

Second, the hint in Ex. 12.3 is phrased so as to emphasize the parallels with the 
construction of the symmetric tensor power space P® n . For the asymmetric tensor 
product S ® T of two spaces, the only formal polynomials that are really used are 
(1; l)-homogeneous, so it wouldn't be hard to do without polynomials entirely. If one 
does choose to use formal polynomials, however, there are interesting connections 
with yet another variant of the tensor-product construction: the tensor product 
of algebras. In particular, the algebra of XK-polynomials is actually the tensor 
product X[S] ® Y [T] of the algebras X[S] and Y[T]. If we form equivalence classes 
by rewriting both X-variables and V-variables in a normal form, the result is the 
tensor product X[S] ® y[T] of the algebras X[S] and l/[T], whose elements are 
tensors on (S;T). The tensor product S® T consists of the 1-flavored (1; l)-tensors 
on (5;T). 

Our motivation for Ex. 12.3 was that we had transformed the (m; n)-ic surface 
F: Ux V -* Q in Fig. 12.1 into an asymmetric biaffine function /®J®: U® m X V® n — ► 
Q. Applying Ex. 12.3 finishes the job, converting /® ; ® into the affine blossom of 
F y which is the affine function /®:17® m ® V® n Q that satisfies the identity 
/(ui, . . . , u m ; t$i, . . . , v n ) = /®(ui • • • u m ® vi • - • v n ). Actually, if we apply Ex. 12.3 
in the most straightforward way, setting 5 := t/® m and T := V® n , we end up with 
an iterated tensor space as the domain. An equivalent but simpler method is to set 
S :- U and T := V and then to consider Z7® m ® V® n as consisting of the 1-flavored 
(m; n)-tensors on {U;V). Note that the algebra X[Z7]®y[V] of all tensors on (U;V) 
is a commutative algebra: if ai ® 61 is a (fci;/i)-tensor on (U; V) and a 2 ® b 2 is a 
(Ar 2 1/2)- tensor on (U; V)> then their product is (ai®6i)(a 3 ®6 2 ) = aia^bib^^ which 
is a (k x + Ai2;/i + /2)-tensor on (U; V). The only noncommutative multiplication in 
sight is the one that takes a tensor on U and a tensor and V and turns that pair into 
a tensor on (U; V). That noncommutative operation on elements is the one that we 
are writing with a ®". 

The asymmetric tensor product operation on spaces that corresponds to the big 
a ®" in the domain (7® m ®V® n of the affine blossom is the instance of tensoring that 
gave bipolynomial surfaces the name "tensor product surfaces." Note that, from our 
point of view, there is nothing about bipolynomial surfaces that connects them more 
intimately than polynomial surfaces to the concept of the tensor product. The only 
difference is that, in the bipolynomial case, one of the tensor products involved in 
building the domain J7® m ® V® n of the affine blossom is asymmetric, while, in the 
polynomial case, all of the tensor products involved in building P® n are symmetric. 
This observation suggests that the name "tensor-product surfaces" is a poor choice. 

The right-hand column in Fig. 12.1 demands only a few remarks. The ho- 
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mogenized bipolynomial map F* is called flavor-biexponentiating, meaning that 
Flav(i^(a,6)) = (Flav(a)) m (Flav(6)) n . The domain of the linear blossom ff is 
most simply thought of as the set of all (m; n)-tensors on (U\V) of any flavor. 
That precise set doesn't have an abstract name by our conventions; but there are 
several abstractly-named sets that are canonically isomorphic to that set. One is 
(£7® m ® V® n )*, the linearization of the domain U® m ® V® n of the affine blos- 
som. Another is 17® m ® V^? n , in which the big "®" denotes the asymmetric tensor 
product of linear spaces in the linear world, the same variant of the tensor-product 
construction that arises in textbooks on linear algebra. 

Exercise 12.4. Let S and T be affine spaces of dimensions s and t and let W 
and Z be linear spaces of dimensions w and z. Verify the following four formulas 
(one of which is just Prop. 4.9): dim(5 ® T) = st + s + t; dim(S® n ) = (*+ n ) - 1; 

dim(W ® Z) = wz; and dim(W® n ) = 

Just as in the polynomial case, one advantage of the homogenized blossoms is 
that we can compute derivatives by replacing some of the points among the blossom 
arguments with vectors. If we let /i denote the unit vector on U and v denote the 
unit vector on F, we have 

|^|^(u,v) = m±ni/»(4— V®*"- 1 *')- (125) 

The falling-factorial coefficients are once again annoying, but we can wash out their 
effects if we consider osculating flats: the osculating flat Oscfc ; j F(it, v) is precisely the 
locus of the points /®(c(u m ~ Ac ® t) n ~~')) as e varies over all 1-flavored (k; /)-tensors 
on (U;V), that is, as e varies over U® k ® V® 1 . This implies that, if it appears 
p times among the arguments to / and v appears q times, the resulting blossom 
value must lie in the flat Osc m _ p;n _ q F(ii y v). For a nondegenerate bipolynomial 
surface F } this differential perspective allows us to compute the blossom value 
/(ui, . . . , u m ; tJi, . . . , v n ) by intersecting the osculating flats corresponding to all 
distinct pairs (xii,Vj). 

The last topic that we will tackle for bipolynomial surfaces is the geometry 
of a surface and its multiaffine blossom. As in the case of polynomial curves and 
surfaces, the best way to study this question is to back up through the affine blossom 
and to study instead the geometry of its domain space U® m ® V® n . Tensors in 
this space of the form u m ® v n correspond to points on the surface F\ let us call 
such tensors perfect (m;n)th powers. Simple tensors, that is, tensors of the form 
«i • • • «m ® t5i • • • v n , correspond to points for which we have multiaffine labels. 

Things get non-trivial as soon as the total degree m + n exceeds one; consider 
the case m = n = 1. A bipolynomial surface F of degree (1;1) is actually itself 
biaffine. Hence, F is equal to its multiaffine (1; l)-blossom /: F(ii, v) = /(u; v). For 
simplicity, let use choose [0, 1] and [0, 1] as our reference segments in U and V. The 
surface patch F([6, 1] X [6, 1]) has the four Bezier points /(6;6), /(6; 1), /(I; 6), and 
/(i; 1). If those four points are affinely independent, the resulting surface F will be 
a hyperbolic paraboloid sitting in a 3-space, with the two families of lines /(it; •) 
and f{-\v) as its generators. 
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This is precisely the geometry that is revealed by a study of the domain i/® 1 ® 
V® 1 == U ® V of the affine blossom /®. Choosing the Cartesian bases {6,/i} for *7* 
and {0,*/} for V*, a general (1; l)-tensor e on {U\ V) has the normal form 

e = a6®6 + &6®i/ + cji®6 + d/i®i/, 

where a = Flav(e). The set U ® V consists of the 1-flavored (1; l)-tensors, and is 
hence 3-dimensional, with (6,c,d) as a Cartesian coordinate system. The simple 
tensors in U ® V are the products u ® v of one point in each space and, in the cause 
m — n = 1, every simple tensor is also a perfect (1; l)th power. The normal form 
/3(e) of the simple tensor e — u ® v is given by /3(u ® v) = (0 + u/x) ® (6 + vi/) = 
(ti,tf,uv). A tensor c = (6,c,d) in U ® V will factor in this form only if d = be; it 
is this equation that generates the curved shape of the hyperbolic paraboloid. One 
example of a compound tensor in U ® V is given by (6 ® 6 + i ® 1 )/2. Since the two 
Bezier points /(0;0) and /(l; 1) both lie on the paraboloid, but the line connecting 
them is not one of the generating lines, the midpoint of that line does not lie on the 
paraboloid. 

13. Degree splitting and degree joining 

The theory of bipolynomial surfaces gives rise to two new problems like degree 
raising, where we cam study the relationship between two different blossoms of the 
same function. First, we can view a polynomial surface of degree n as a degenerate 
case of a bipolynomial surface of degree (n;n). We shall call this adjustment of 
the degree degree splitting. Second, we can view a bipolynomial surface of degree 
(fc;/) as a degenerate case of a polynomial surface of degree k + L We shall call 
this adjustment degree joining. In this section, we shall derive formulas for degree 
splitting and degree joining, using both the coordinate-based and the coordinate-free 
approaches. 

We shall begin by reconsidering degree raising and degree lowering, but for the 
particular case of surfaces. Let P = U x V be a parameter plane with a product 
structure, and let F:P — ► Q be a polynomial or bipolynomial surface. For any 
nonnegative integers t and j, we can imagine computing the (t;i)th derivative 

3i a >«»,.» 



du % dv' 



of F as a function of the position (u,v). If F is a polynomial surface of degree n, 
this derivative will be identically zero whenever i + j > n. If F is a bipolynomial 
surface of degree (A;;/), this derivative will be identically zero when either i > k 
or j > I. If we associate the (*; j)th derivative of F with the lattice point {•,,;) 
in some plane A, we can think of each type of surface as a geometric figure in A. 
The possibly-nonzero derivatives of an n-ic polynomial surface form an isoceles right 
triangle T n in A, while the possibly-nonzero derivatives of a (Jfe; /)-ic bipolynomial 
surface form a rectangle R^h The processes of degree raising and degree lowering 
move around the borders of these A-figures. 

To begin with, consider an n-ic polynomial surface F> whose A-figure is the 
triangle T n . Raising the degree of F corresponds to moving the hypotenuse of this 
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triangle outward to form T m for m > n, choosing the output m-ic surface G to 
be degenerate in the sense that all of its derivatives in the strip T m \ T n are zero. 
Lowering the degree of F corresponds to moving the hypotenuse inward to form 
T m for m < n. In the process, we are forced to replace F by a simpler surface G, 
which osculates F to mth order at a chosen point r, but whose derivatives in the 
strip T n \ T m are zero rather than matching those of F. The formulas in Section 11 
tell us how to raise or lower the degree of a polynomial surface, using either the 
coordinate-based or the coordinate-free approach. 

Bipolynomial surfaces F of degree (k]l) have two different degrees, each of 
which can be raised or lowered. Raising one of the degrees moves one edge of the 
rectangle R^i away from the origin, while lowering that degree pulls that edge back 
towards the origin. The formulas in Section 11 can also be used to raise or lower 
either degree of a bipolynomial surface: while we work on one group of variables, 
the other group just comes along for the ride. 

Thus, the formulas in Section 11 suffice for performing any adjustment to the 
degree of a surface that preserves the shape of the A-figure; they can take any trian- 
gle to any triangle, or any rectangle to any rectangle. It also makes sense, however, 
to convert back and forth between triangles and rectangles, that is, to osculate a 
polynomial surface with a bipolynomial surface or vice versa. The same rules apply: 
Points in the output A-figure that weren't in the input A-figure correspond to new 
derivatives that we choose to set equal to zero. If any of these exist, they are a 
source of degeneracy in the output surface. Points in the input A-figure that aren't 
in the output A-figure reflect behavior of the input that is of too high an order to be 
captured in the output. If any of these exist, the output will only osculate, rather 
than equal, the input. 

Degree splitting is the transformation T n Rk ; i, which osculates a polyno- 
mial surface with a bipolynomial one. Degree joining is the reverse transformation 
Rk,i ^ T n . The following proposition describes degree splitting and joining from 
the coordinate-based approach. 

Proposition 13.1: Coordinate-based splitting or joining. Let P = U xV be 
an affine plane with a product structure; let r = (r u , r v ) be a point in P; let \i and 
v denote the unit vectors on U and V; let F:P — ► Q be a polynomial surface of 
degree n; and letGiU x V — ► Q be a bipolynomial surface of degree (k; I). Consider 
the formula 

If this formula holds for % < k and j < /, then G is the (A:; /)-ic surface that osculates 
F to order (Jfe;/) at r. if the formula holds for i + j < n, then F is the n-ic surface 
that osculates G to order n at r. 

Proof. By Prop. 8.4 and Eq. 12.5 respectively, the left-hand and right-hand sides of 
that formula express the (t; j)th derivatives of F and G at r. □ 

The coordinate-free approach is more complicated, and we shall therefore adopt 
a multi-step strategy. Note that a general degree joining of the form Rk t i T n can 
be achieved in two steps: Rk ; i Tk+i T n . In the reverse direction, we can achieve 
a general splitting T n *-* Rk ; i in three steps: T n h-> Rn ;n ^ Rk t n Rk;i- Since we 



13.3 



DEGREE SPLITTING AND DEGREE JOINING 



77 



already know how to raise and lower degrees, it suffices for us to study the particular 
adjustments R k] i *-> T k+i and T n i-> R n . n . Of course, if we build the formula for 
a general degree adjustment by using a multi-step strategy like one of these, the 
formula that results will probably be more complicated than necessary. But such 
strategies do provide, at least in principle, a way of building up any adjustment of 
the degree of a surface. 

The particular adjustments R k;i h-> Tk+i and T n i-* R n;n both have simple 
coordinate-free formulas. In the first case, we sum over partitions; in the 

second case, we sum over n! permutations. 

Proposition 13.2: Coordinate-free joining. Let F: U x V — ► Q be a bipolyno- 
mial surface of degree (A:;/), let f be the multiafEne (k] I) -blossom of F, and let g 
be the multiafRne (k + l)-blossom of F, that is, the multiafEne blossom of the map 
G that results from viewing F as a degenerate polynomial surface of degree k + L 
The (k + I) -blossom g is given, in terms of the (k; l)-blossom f, by the formula: 

/UJ = {1 fc+J) 

l/l=fc, \J\=l 

Proposition 13.3: Coordinate-free splitting. Let F:P — ► Q be a polynomial 
surface of degree n, and suppose that the parameter piane P has a product struc- 
ture P = U x V. Let S n denote the symmetric group on n letters, the set of all 
permutations of the set {1, . . . , n}. The (n; n)-blossom g of F is given, in terms of 
the n~ blossom /, by the formula: 



Part E: Spline Curves 



It is time for a major change in focus. We have seen so far that blossoms are a 
useful tool for working with polynomial curves and surfaces. But a single polynomial 
curve or surface isn't a very complicated thing no matter how you think about it. 
The real power of the blossoming technology becomes apparent only when it is used 
to analyze splines. In particular, as de Casteljau discovered [14], blossoming extends 
to parametrically continuous spline curves with surprising ease and elegance. 

14. On blossoms and joints 

One comment before we begin: mathematicians generally think of a spline as 
a real-valued function, while we will be thinking of splines as taking values in an 
arbitrary affine object space Q, At some level, there isn't much difference between 
these two approaches. In particular, a function F(u) is a spline in our sense if and 
only if each coordinate of F(u) is a spline in the mathematician's sense. For our 
purposes, there are two advantages to dealing with a general object space Q rather 
than restricting ourselves to the real numbers R. First, we are then dealing more 
directly with the objects of interest in computer-aided geometric design. Second, 
it is easier to draw informative figures when the object space Q has at least two 
dimensions. An n-ic polynomial curve is degenerate whenever the affine span of its 
range is a flat of dimension less than n. An n-ic polynomial curve F: L — ► R, with 
values in a one-dimensional object space, is so degenerate that it is impossible to 
draw reasonable pictures of it. Instead, one must resort to alternative strategies 
such as drawing the graph of F y which is the polynomial curve G: L — ► given by 
G(u) := (u,F(u)). 

Let L be the affine line, and let Q be an arbitrary affine object space. We shall 
suppose that the line L has been partitioned into intervals by a certain increasing 
sequence of points {£*}> which can be finite, infinite, or bi-infinite. For each t, we 
shall also suppose that we are given a certain polynomial map FiiL — ► Q of degree 
no greater than n. We can assemble an n-ic spline curve F out of the given n-ic 
curves Fi by specifying that F(u) := F»(u) whenever U < u < t» +1 . If the curves 
Fi-i and Fi happen to agree at the point F,*, it is natural to specify that the spline 
F should also share that common value; but if Fi-i and Fi do not agree at t,, we 
shall consider F[Ii) to be indeterminate. Note that, from a formal point of view, 
a spline curve is the same thing as a piecewise polynomial curve. The difference is 
that, when dealing with spline curves, we usually enforce some sort of continuity 
conditions at the joints between adjacent segments. 

If F:(¥ t S) — ► Q and G:(s y i) — ► Q are polynomial curves defined on adjacent 
intervals in P, there are two different types of continuity constraints that we could 
impose on the joint at I between F and G. This joint is called parametrically 
continuous of order Jfe, or C k continuous, if the zeroth through fcth derivatives of 
F and G agree at 5, that is, if the spline curve formed by combining F and G is 
C k continuous over the entire interval (r,F). Parametric continuity is the simplest 
notion of continuity, and it is the natural notion to use in situations where the 
parameterizations of the curves are important as well as their shapes. 
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In many cases in computer-aided geometric design, however, only the shapes 
are important. In such cases, we can use a more liberal notion of continuity. The 
joint at 5 between F and G is called geometrically continuous of order k or G k 
continuous (or visually continuous of order k or VC k continuous) if F and G can 
be reparameterized in a neighborhood of 8 in such a way that the reparameter- 
ized curves join with C k continuity [15]. (One technical detail is that only regular 
reparameterizations should be allowed, that is, reparameterizations that are locally 
invertible.) Allowing reparameterization leads to a more purely geometric notion of 
continuity. For example, G 1 continuity for a curve means continuity of slope, while 
C l means continuity of velocity; G 2 means continuity of curvature, while C 2 means 
continuity of acceleration. (Lest the reader be seduced by the apparent pattern, it 
is important to note that there are more than two choices of continuity conditions 
for order three and above. Parametric, C 3 continuity is continuity of jerk, that 
is, continuity of the third vector derivative of position with respect to time. The 
reparameterization allowed in geometric continuity makes G s continuity a weaker 
condition than C 3 . But continuity of torsion is an even weaker condition than G 3 , as 
Bohm points out in a recent paper [4].) Geometric continuity is an interesting area. 
Unfortunately, as we shall discuss in Section 20, I haven't been able to figure out a 
good way to handle geometric continuity with the blossoming technology. For now, 
we shall restrict ourselves to spline curves with parametric continuity constraints. 

Parametric continuity and blossoming were made for each other. In particular, 
Prop. 8.8 tells us that the two n-ic curves F: (f y s) Q and G: (s,f) — ► Q join with 
C k continuity at § if and only if the blossoms / and g agree on all argument bags 
that include at least n — k copies of § , that is, if and only if /®(iti • • • Uk§ n ~ k ) = 
g®(iii • • • u k s n ~ k ). Thus, each new order of parametric continuity means that one 
more blossom argument can be varied without destroying the agreement between 
the blossoms of the joining curves. If F and G meet with only C° continuity, we 
can't vary any of the arguments away from 5 without destroying agreement; all 
we know is that F(s) = /®(s n ) = g®(s n ) = G(s). If F and G meet with C n 
continuity, so that they are actually identical, we can vary all n arguments to the 
blossoms without destroying agreement. 

Fig. 14.1 shows examples of joining cubic segments with various orders of con- 
tinuity. Note that, when the joint between F and G has C k continuity, we have 
f®(ri = 0®(r*t'5 n -»-J) whenever i + j < k. The points given as values 
of both / and g by these equalities form a de Casteljau Diagram with k shells, which 
is indicated in Fig. 14.1 in bold. We can interpret this diagram in three different 
ways: either it extrapolates forward from the last A; + 1 Bezier points of ^([f,5]) 
to tell us where the first k + 1 Bezier points of G([s,£]) must be; or, vice versa, it 
extrapolates backwards from G to F; or, more symmetrically, it constrains both the 
last k + 1 Bezier points of F([r,s]) and the first A: + 1 Bezier points of G([s y i]) so 
as to guarantee C h continuity at the joint. 

Spline aficianados have a different way of measuring the smoothness of the 
joint between two curves [8]. Rather than counting the number of derivatives of the 
joining curves that are guaranteed to agree, they count the number of derivatives, 
from the nth on down, that are allowed to disagree. In particular, suppose that 
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Fig. 14.1. Two cubic curves joining with C°, C 1 , C 2 , and C 3 continuity 

the n-ic curves F:(f,§) — ► Q and G:(5,f) — ► <9 have a C k joint at the point 8. 
The parameter value 3 at which the joint occurs is called a knot, and the number 
m :~ n — k of derivatives that are allowed to be discontinuous as we pass through the 
knot is called the multiplicity of the knot. This convention works out well because 
a knot of multiplicity m behaves very much like the limit of a cluster of m closely 
spaced single knots that have coalesced. Rephrasing Prop. 8.8, we observe that the 
multiplicity of a knot is precisely the number of blossom arguments that must be 
kept fixed at that knot in order to guarantee that the blossoms of the two joining 
curves will agree. 

These conventions about knot multiplicities allow us to specify all of the bound- 
ary conditions of a spline curve by means of one sequence of points {£,}, called the 
knot sequence, which consists of all of the knots repeated according to their multi- 
plicities and sorted into non-decreasing order. If the common value t t +i = £,+2 = 
• • • = *i+ m is a knot of multiplicity m, then the curve Fii (f»,F»+i) — ► Q is required 
to join the curve *<+ m : (f»+ m ,f» +m +i) — ► Q at that knot with C n ~ m continuity. In 
this way, a knot sequence specifies both the locations and the smoothness levels 
of the joints allowed in a spline curve. 

The knot sequence is just what we need to understand the extent to which the 
blossoms of two segments of a spline curve are guaranteed to agree. In particular, 
we can show that the argument bags on which lots of blossoms must agree are the 
bags that contain substrings of the knot sequence. 

Let {ii} be the knot sequence of an n-ic spline curve F(u). For each t such that 
i% < ii+x , the behavior of F(u) for ti in (t iy U+i) determines an n-ic polynomial curve 
Fi(u), and that curve has an n-blossom /t(«i, . . . , u n ). For simplicity at first, let us 
suppose that there are no multiple knots. Consider the values of the various blossoms 
when applied to an argument bag of the form B = {f t +i>. . . ,f»+i,uj+i,. . . ,u n }. 
Since the bag B includes the simple knot the blossoms /» and must agree 
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on B. Since B also includes t;+ 2 > the blossoms / t +i and /»+2 must agree on B. 
We deduce that, in fact, all of the blossoms from fi through f i+ i must agree on 
B. Furthermore, this conclusion still holds even if we allow some or all of the / 
simple knots t t+1 through F»+j to coalesce into knots of higher multiplicity. For 
example, if t t - +1 = i i+2 is a double knot, then there is no segment Fi+i and no 
associated blossom But the adjacent blossoms fi and /<+ 2 must still agree on 

the argument bag B because it includes two copies of the double knot i i+x = f t+2 . 
This sort of reasoning demonstrates the following. 

Proposition 14,2. Let the nonempty interval (t»-,t»+i) be the domain of one seg- 
ment of a spline curve F with knot sequence {ii}, and let the nonempty interval 
b e the domain of a later segment. The blossoms fi and /y of these two 
segments must agree on any argument bag that includes all of the intervening knots 
as often as their multiplicity, that is, on any bag that includes {£+1, . . . Jj) as a 
sub-bag. All of the blossoms of intermediate segments, if any, will also agree with 
fi and fj on such argument bags. □ 

To take full advantage of this proposition, let us extend our notation by allowing 
ourselves to write sets of indices as well as single indices as subscripts on /. If we 
know that all of the blossoms /» for t in the nonempty set S must return the same 
value on the particular argument bag {u u .. . ,u n }, we shall denote that common 
value by the term /s(«i,. . . ,u n ). In particular, a common value of / t and /y 
as guaranteed by Prop. 14.2 would be denoted / {i( y } (f,+i, . . . ,iy, uy_ t+1 , . . . ,u«). 
The rules for manipulating with these new f s terms are pretty straightforward. For 
example, if we know the points f s (x, u 2 , . . . , u n ) and f T (y , u 3 , . . . , u n ) and if the sets 
5 and T are not disjoint, we can linearly interpolate (or extrapolate, as necessary) 
to compute the point fsnriz, u 2 , . . . , u n ) for any z. 

One minor problem with the f s notation is that our indexing scheme for the 
segments of a spline is rather complex. The curve segment Fi is the one that the 
spline curve follows from the knot U until the knot f t+1 , if these two knots are 
distinct. If t,- = there is no segment F iy and hence we really shouldn't include t 
in the set S of an fs term. To avoid having to worry about which indices are valid, 
we shall extend our notation once again by allowing ourselves to write //, where I is 
an interval in the domain space L. In particular, we define the term //(tii, . . . , u n ) 
to denote f s {ui> . . . , u n ) where S := {t | n J ^ 0}. In order to write down 

a term of the form fi(u x , . . . , u n ), we must be able to show that the various fi for 
1 in this set 5 will agree on the argument bag {u x ,. . . , u n }, of course. In a term of 
the form fi(u u . . . , u n ), we shall refer to I as the validity interval. 

The result of Prop. 14.2 looks particularly simple when written in terms of 
validity intervals. It tells us that the term /(f<,f y+1 )(t.+i, . - . ,*y,tiy-<+i, . . . , u n ) is 
well-defined whenever its validity interval is nonempty, that is, whenever ti < fy+i. 
This result sets the stage for the multiaffine view of the de Boor theory of spline 
curves, which we tackle next. 
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15. The blossoms of spline curves 

What are the cases in which Prop. 14.2 is pushed as far as it can go? That is, 
what are the argument bags on which the widest range of segment blossoms must 
agree? The extreme case occurs when j — i + n, in the terms 

Note that there are no free arguments uy left in this term. What we have is a single 
argument bag on which n + 1 adjacent blossoms are known to agree. Let's call 
the agreed-upon value a pole of the spline F, since, as we will see shortly, it is a 
pole in the sense of the de Casteljau technique of Section 6 for the n+1 segments 
of the spline F that do agree. (If there are multiple knots involved, some of the 
n + 1 blossoms that agree on a pole may have degenerated into nonexistence.) An 
amazingly simple thing happens at this point in the theory of spline curves — be 
warned that this doesn't work for spline surfaces: the poles form a basis for the 
space of all spline curves with {U} as their knot sequence. That is, not only can we 
compute the poles from the spline curve; we can also go backward and compute a 
unique corresponding spline curve from arbitrarily specified values for the poles. 

Proposition 15.1. Let n be a nonnegative integer and let {?,} be a knot sequence 
in the affine line L that doesn't include any knots of multiplicity greater than n + 1. 
Then, spline curves F:L — ► Q of degree n with the knot sequence {U} are in one- 
to-one correspondence with sequences of points {x»} in Q by means of the formula 

Proof Given a spline curve F:L — ► Q with the knot sequence Prop. 14.2 

shows that the poles x» are well-defined, as long as each validity interval (^,F»+ n +i) 
is nonempty. This validity interval must be nonempty because otherwise the value 
U would be a knot of multiplicity at least n + 2. 

The reverse direction is harder. Let {x^} be an arbitrary sequence of points in 
Qy and let (tfc,ifc+i) be a nonempty interval in the knot sequence. Our first task 
is to construct the curve segment Ft that the spline will follow during the interval 
(*fcj*Jb+i)- To do so, consider the 2n knots centered around that interval, that is, 
the knots in the sequence (tfc- n +i, • • >*Jfe+n). We want to build a de Casteljau 
frame based on this reference sequence. To do so, we must verify the nondegeneracy 
conditions fy ^ f n +^ for 1 < t < j < n, as specified in Lemma 6.1, where u = 
Ffc_ n +i. In this case, that is easy to do: the first n points in the reference sequence 
are all at most while the last n points are all at least tfc+i, and we have ik < ffc+i- 
Therefore, by the de Casteljau technique of Cor. 6.3, we deduce that we can specify 
a unique n-ic curve Fk(u) by arbitrarily specifying its poles, that is, the blossom 
values /fc(**-n+i,...,*~fc)> fk(h-n+2,.. ,h+i)> -Jfc+n). We choose 

to specify Fk(u) by the conditions 



/ib(ft+i,...,*i+n) := x < fort in [fc-n,fc]. 



(15.2) 
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Suppose that we have used this technique to determine Fk(u) for each k with 
h < ffc+i- As a consequence, for any fixed t, it will be the case that x^ = 
fk . .,f»+ n ) for all k in [i,t + n] that satisfy ik < ifc+i- That same collec- 
tion of indices k can also be described as those for which n (F^+n+i) is 
nonempty. Hence, by the definition of the // notation, we will have 

There is one thing left to verify: we must show that the resulting polynomial 
curves Fk really do fit together to form a spline with the appropriate continuity. 
Suppose that ii+i = • • • = F»+ m is a knot of multiplicity exactly m, so that i t < 
and *»+ m < *i +m+1 . We want to show that the curve Fi joins the curve Fi+ m at 
that knot with C n_m continuity. If m = n + 1, which is the largest multiplicity that 
the hypotheses allow, we interpret C" 1 continuity to mean that there is no relation 
between Fi and i^+ m , so there is nothing to prove. Therefore, let us assume that 
m < n, and let / denote the level of continuity I ~ n — m. By Prop. 8.8, showing C l 
continuity at the joint is equivalent to showing that the blossoms fi and /» +m agree 
on all argument bags that include at least m copies of the knot, which, in symbols, 
is the identity 

/»(f*-*-lj.. ,f»+m,tii,...,U|) = /»+m(ft+l,...,ft+m,«l, 

We know something about the blossoms fi and /,+ m already, from Eq. (15.2). 
In particular, we know that . . . , F J+n ) = xy for j in [t - n,t], and we know 

that /t+m(*y+i, . . . jjj+n) = for j in [i - /,t + m]. Putting these facts together, 
we deduce that fi(t j+u . . . ,F y+w ) = /• +m (fy + i,...,f i+n ) for j in [t - /,t]. More 
pictorially, we know that fi and fi+ m agree on the rows of the parallelogram 

+ ! *»_l+2 ... tj t i+1 . . . f t+m 

+ 2 ... ti ti+i ... ti+ m i^+m+i 

ti tj+i ... ti+ m tj+m+X ... t»+ m -|_i_i 

We can finish off the proof by cutting the middle m columns out of this parallelo- 
gram, collapsing the two remaining triangles to form a smaller parallelogram, and 
then applying Cor. 6.3 once again, as follows. 

Let g denote the symmetric, multiaffine function of / arguments that results 
from fixing m of the arguments of fi at the value of the knot, that is, 

g(iii 7 . . . , u t ) ~ /»(?<+!, . . . , ii+ m , t2i, . . . , Uj), 

and let h be the corresponding restriction of /t+ m - The fact that fi and /»+ m agree 
on the rows of the parallelogram above implies that g and h have the same poles 
with respect to the reference sequence . . . ,?»,i t -+ m +i, . . . ,t<+ m+ /), which 

consists of the / knots that precede concatenated with the / knots that follow 
U+m- Furthermore, this reference sequence satisfies the nondegeneracy condition of 
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Fig. 15.4. A cubic spline curve G with knot sequence (0,0,0,0,1,2,3,4,6,6,6,6) 

Lemma 6.1 because each left-half element is at most f,, each right-half element is at 
least £"»+ m +i, and we have U < tt+i = U+m < ?<+ TO +i- Thus, we may conclude by 
Cor. 6.3 that g and h are identical. This implies that the blossoms /» and /«+ m agree 
on all argument bags that include m copies of the knot = U+m- By Prop. 8.8, 
we conclude that Fi and Fi+ m do join with C l continuity at that knot. □ 

Our next task is to demonstrate that the spline-controlling technique based on 
poles in Prop. 15.1 is just the standard de Boor technique in a new guise. One 
warning: many authors use the variable n to denote the order of a spline curve, 
while we have been using (and will continue to use) n to denote the bound on the 
degree instead. For example, a cubic spline has degree 3 (or less), but has order 4. 
For our purposes, the degree is a more important concept than the order, since the 
degree counts the number of arguments of the blossom. 

Proposition 15.3. The poles x t = /(f< ( F,. +n+l )(£*+i,. . . >**+n) of an n-ic spline 
curve F are precisely the de Boor points by which that spline curve is controlled in 
the standard theory Furthermore, the validity interval (ti,F»+ n +i) of the pole x» is 
precisely the region of the parameter space L over which x» inSuences the value of 
the spline curve; that is, the pole x» influences the spline segments Fk{u) precisely 
for k in [i,t + n]. 

Proof We shall prove this by writing down the multiaffine labels for all of the 
points that occur in the de Boor Algorithm. If you are reading for concepts rather 
than details, you might want to skip this proof. But you should take a moment to 
study the labels in Fig. 15.4, which shows an example of the de Boor Algorithm 
computing the point G(2.6) on a cubic spline curve G, whose knot sequence is 
(0,0,0,0,1,2,3,3,8,8,6,6). 

The de Boor Algorithm [11] defines a spline F(u) of degree n on the knot 
sequence {U} given a sequence of de Boor points {y,} by setting F(u) := o£ +1 \u) 
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where k is chosen so that tk < u < ik+i and the functions aj ,+1 '(u) are defined by 



(Yi, ifj = 0 

(t i+n . j+1 - iQq&M + (« - t,.)«['l(u) 

, ifj>0. 



The correspondence with the multiaffine point of view is given by the identity 

For j = 0, de Boor points y» = a^u) themselves are the same as the poles 

For j > 0, the formula that computes a[ J+1 ' (u) by interpolating between a[ J '(u) 
and Qf^i^u) corresponds to computing 



/(®,f.. + „_ y+l) ^+ i "-*«+»-i iiy ) 

by interpolating between 



and 

The latter two points share all but one argument; the varying argument is ti+^j+i 
in the former case and f,- in the latter case. We can turn this varying argument into a 
jih copy of ii by interpolating with the ratio ii : u : £,+ n _y +1 , just as the j > 0 case 
of the <*' J+1 '(u) recurrence tells us to do. The result of this interpolation receives, 
as validity interval, the intersection of the validity intervals of the two input points, 
just as it should. 

For j = n, we get a! n+1, (u) = / (f . |f . +l) (u, . . . , u) = F<(fi) for all i. If we then 
choose k so that u lies in (h>h+i), we have a£* l \u) = F k (&) = ^(u). 

To verify the claim about the validity interval of the pole x», we can prove 
by induction on j that the point x» = /(f,,f <+n+1 )(f»+i, . . . ,f*+ n ) aflFects the values 

a L J+11 (°) precisely for in [•',« + j]. In particular, x< aflFects 2?fc(u) = a{* +1, (ti) 
precisely for A: in [t, t + n]. This means that the spline F(u) is affected precisely for 
u in the validity interval (f t *,f<+ n +i). □ 

Project 15.5. Pick a computer scientist at your table who doesn't know anything 
about splines. Explain the de Boor technique for specifying polynomial spline curves 
to that lucky person, drawing all your figures on one paper napkin. Don't limit your 
presentation to cubics, and don't assume uniform knot spacing. 
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16. Overloading the notation for a spline blossom 

When working with the blossom / of a spline curve F, it is irritating to have to 
keep writing the validity intervals as subscripts on / all the time. Indeed, the labels 
in Figs. 1.3 and 1.4 were spline blossom values without validity intervals. The way to 
avoid writing the subscripts is to invent a scheme for deducing the proper subscript 
from the points in the bag of arguments. Note that the notation for the spline curve 
F itself involves just such a deduction scheme. To interpret the term F(u), we first 
deduce the proper t by determining which domain interval (t»,£»+i) contains the 
argument u; then we evaluate Fi(u). We shall refer to the adoption of a scheme for 
deducing the proper subscript as overloading. The non-overloaded notation Fi(u) 
is more powerful, since it allows us to evaluate any segment of the spline at any 
point, even points where u < ii or u > t^+i. But the overloaded notation F(u) is 
more convenient. Achieving the same convenience by overloading the notation for 
the spline blossom / is rather delicate, because there are n arguments instead of 
just one. 

Starting at the beginning, note that we can't possibly make overloading work 
when n = 0, since the blossom / doesn't have any arguments to examine. Indeed, in 
the case n = 0, the validity intervals convey just the right information. A de Boor 
point of a zeroth degree spline curve (a "constant spline") F(u) has a label of the 
form /(f.^ +l) (), and we have F(u) = / ( f <( f <+1 )() whenever u is in {iiji+i). 

When n = 1, each affine segment F{(u) of the spline is equal to its blossom 
/»(u), so we can overload the spline blossom in the same way that we overload the 
spline itself. The only problems arise at knots of multiplicity at least 2. 

When n > 2, there are two reasonable conventions that one could adopt, which 
we shall refer to as tame and wild overloading. In each case, the heart of the conven- 
tion is a rule for deciding, given a bag of arguments {uy}, which parameter intervals 
(t«)tt+i) should be candidates for use when assigning a value to the blossom expres- 
sion /(ui, . . . , u n ). Given a rule for candidacy, a blossom expression /(Oi, . . . , u n ) 
is well-defined if and only if there is at least one candidate interval and all of the 
values /(£ <1 £< +l )(ui,...,u n ) for candidate intervals agree. 

In tame overloading, the interval is a candidate for use when defining 

/(ui,...,u n ) whenever the closed intervals and [min{uy},max{uy}] inter- 

sect. For example, if the knot sequence {U} is the integers i» := t , there are three 
candidate intervals for the term /(0.5,2) under tame overloading: (0,1), (1,2), and 
(2, 3). Appealing to Prop. 14.2, we can deduce that the expression /(ui, . . . , ti n ) is 
well-defined under tame overloading if and only if every knot in the closed interval 
[min{tiy},inax{uy}] is included in the bag {uy} at least as often as it is a knot. In 
the example above, the term /(0.5,2) is not well-defined, because the simple knot 
1 doesn't appear as an argument; but the term /(0.5, 1, 2) would be well-defined. 

In wild overloading, there are two cases. If the blossom arguments uy are not all 
equal, then the interval (t»,f»+i) is a candidate for defining the term /(«i, . . . , tin) 
only when the open intervals (f 4 -,fi+i) and (min{tiy},max{uy}) intersect. If all of 
the uy are equal, the interval (t% f ii+i) is a candidate whenever the common value 
of the uy's lies in With the integers as the knots once again, only the two 

intervals (6,1) and (1,2) are candidates for use when assigning a value to the term 
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/(0.5,2) under wild overloading. Both of these intervals would also be candidates 
in the case of /(I, 1), but the interval (1, 2) would be the only candidate for /(l, 2). 
Appealing once again to Prop. 14.2, we deduce that the expression . . ,tt n ) is 

well-defined under wild overloading if and only if either min{tiy} < max{tiy} and 
every knot in the interval (min{uy},max{uy}) is included among the fiy at least as 
often as it is a knot; or min{uy} = max{uy} and the multiplicity of that common 
value as a knot does not exceed n. For example, if we adopt the knot sequence 
{ft} where t+ := [t"/2j, in which each integer is a double knot, then / (515,1,1,2) is 
well-defined under wild overloading, even though it would not be well-defined under 
tame overloading. 

The power of wild overloading is quite helpful, as we can see by comparing the 
labels in Fig. 1.4 to those in Fig. 15.4 (remembering that the labels in Fig. 1.4 are 
missing the bars over their arguments). Note that the label 0(0,1,1) in Fig. 1.4 is 
not well-defined under tame overloading, since 0 appears only once as an argument, 
while 0 appears four times as a knot. Indeed, the presence of one 0 among the 
arguments of g tells us that 0(0,1,1) should lie in the osculating plane Osc2G(0). 
But G(u) is not even C° at u = 0; hence, it has two different osculating planes there, 
one associated with small positive u and the other with small negative u. The wild 
overloading convention makes 0(0,1,1) well-defined by choosing the positive plane, 
on the grounds that the other argument values are all nonnegative and at least 
one of them is positive. More generally, the term /®(f/*~ J f/ +1 ) is a well-defined 
reference to the jth Bezier point of the n-ic segment -F([f»,^+i]) in almost every 
case, under wild overloading. The lone exception is the term when f, is a 

knot of multiplicity exceeding n. The label 0(0,0,0) in Fig. 1.4 is an example of 
this exception; it isn't well-defined even under wild overloading, just as G(0) isn't 
well-defined. 

But the power of wild overloading can lead to confusion in some cases. Consider 
a quadratic spline curve F(u) with knot sequence (...,0,1,2,2,3,4,...). The term 
/(1,2) is well-defined under wild overloading, with the value /(o,a)(l, 2). Similarly, 
the term /(2,3) is well-defined, with the value /( 5 ^(2,3). Furthermore, the terms 
/(1,2) and /(2,3) have all but one argument in common. We might be tempted to 
compute /(x,2) for 1 < z < 3 by interpolating between /(1,2) and (2,3). But the 
validity intervals of /(1, 2) and /(2,3) are disjoint; hence we have no right to do so. 
Such apparent failures of multiaffineness can't happen under tame overloading. In 
particular, suppose that /(x, u 2 , . . . , u«) and /(y, u 2 , • - - , u n ) are both well-defined 
under tame overloading. Then, for any z between x and y, the term f(z, u 2 , . . . , u n ) 
will also be well-defined under tame overloading, and the latter point can be found 
by interpolating between the two former points. 

17. The trellis of a spline curve 

In this section, we shall pause to reap the rewards of our labors by looking 
at some pictures of spline curves. Let the latter part of the Roman alphabet from 
p through z denote a portion of the knot sequence of a parametrically continuous 
spline curve F of degree n. The figures below will be drawn as if the knots were 
equally spaced, since that makes it easier to detect visual patterns; but we will refer 
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Fig. 17.1. An affine trellis 



Fig. 17.2. A quadratic trellis 



to the knots as u, v, and the like, rather than 2, 3, and the like, in order to emphasize 
that arbitrary knot spacings are possible. 

For the first batch of figures, suppose that we are particularly interested in the 
behavior of the curve F over the two adjacent intervals (t,u) and (u,v). Figs. 17.1 
through 17.6 depict the neighborhood of those two segments of F for degrees n = 1 
through n = 6. In each case, the points are labeled with a string of letters, with the 
special-purpose convention that, for example, Huvv" means /(*,,D)(t> u> *0- That 
is, we leave off the symbol /, the parentheses, the commas, and the bars, and we 
deduce the validity intervals by the use of an overloading convention — the tame 
convention will suffice if we assume that there are no multiple knots. 

In each case, the points whose labels consist entirely of t's and u's are the Bezier 
points of the segment ^(f |0 ) ([*>**]), those whose labels consist entirely of u's and 
v's are the Bezier points of ^(a,c)([ fi )^])- If « is a simple knot, as is the case in the 
figures, we have C"*" 1 continuity at 0. The bold lines in each diagram show the 
de Casteljau Diagram of order n — 1 that geometrically demonstrates this level of 
continuity. Note that the points of this de Casteljau Diagram are precisely those 
whose labels are drawn from the set {t,u, v}. Each of them contains at least one u, 
reflecting the fact that this diagram lies in the osculating flat Osc n _i F(u). 

The diagram in Fig. 17.3 for a cubic spline is well-known, although not with 
these labels. It is important to realize that this type of diagram is associated with 
every cubic spline curve, whether we used the de Boor theory to draw that spline or 
not. For example, consider the classical case of C 2 interpolating cubic spline curves. 
In this scheme, the designer chooses the locations of the joints /(u, u, u), /(v, v, v), 
and the like, while the knots u, v, and the like are chosen by some other rule — 
perhaps equally spaced or with spacing proportional to the chord length between 
the corresponding joints. Given this data, we must solve a system of linear equations 
over the entire spline curve in order to compute the geometry shown in Fig. 17.3. 
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Fig. 17.3. A cubic trellis 

(We must also choose some end conditions, in order to give us as many equations 
as unknowns in this linear system.) In a scheme based on cubic B- splines, the 
designer chooses the de Boor points /(«,f, u), /(£, u, t>), and the like, and the rest 
of the geometry shown in Fig. 17.3 is computed locally from them. But the same 
geometry exists in both cases. The only difference between the two schemes is which 
points and parameters are chosen as the input from which everything else must be 
calculated. 

Two interesting new phenomena arise in the quartic case, shown in Fig. 17.4. 
First, this is lowest degree for which the de Boor points of the curve are not part of 
the de Casteljau Diagrams that enforce continuity at the joints. As the degree goes 
up, the de Casteljau Diagrams move in more and more, getting further and further 
away from the de Boor points and the control polygon that connects them. The 
second interesting thing that happens when n = 4 is exemplified by the point uuvv: 
it is the first point we have seen that lies at an intersection that looks like an X 
rather than a Y. This means that there are two different ways to compute the point 
uuvv by linear interpolation from other points: we can interpolate between tuuv and 
uuvvj or between tuvv and uvvw. It is not immediately obvious, geometrically, why 
these two interpolations always give the same answer; but the blossoming technology 
makes it clear that they do. 

There wasn't enough space to label all of the points in the diagram for n = 5 in 
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uvwx 




rstu 

Fig. 17.4. A quartic trellis 

Fig. 17.5. The new phenomenon that arises here is pairs of line segments that cross 
in the digram without a dot at the crossing: for example, a little above the middle, 
the pair of lines tuvv* and tuuu*. Such crossings only arise because our figures are 
degenerate, by virtue of being drawn in a plane. If the de Boor points in Fig. 17.5 
were in general position, such pairs of lines would be skew. 

Fig. 17.6 shows the case n = 6. The pictures become harder to interpret 
around n = 6 because of the number of near-coincidences that start to arise. For 
example, just above the Bezier point uuuvvv are two points quite close together. 
Of that pair, the one on the de Casteljau Diagram is tuvvvv, while the other one is 
uuuuvw. The major new structural phenomenon that occurs for n = 6 takes place 
at the midpoint of the outermost shell of the bold de Casteljau diagram, at the point 
ttuuvv. This point can be computed by interpolating between other points in three 
different ways, corresponding to the three line segments in the interior of which it 
lies: [sttuuv.ttuuvw]^ [sttuvv y ttuvvw], and [stuuWytuuvvw]. 
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Fig. 17.5. A quintic trellis 
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Fig. 17.6. A sextic trellis 
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,/(0,2) = /(4,8) 

/(0,0) = /(4,4) = / ( M )X^ x «*, 2 ) = /( M ) 
m l) = f(7 ' 8)0 /(MT^f^* 2 ' 3) = /(6 ' ?) 

Fig. 17.7. A quadratic spline curve that is additively periodic 

As these figures have shown, every spline curve has a framework of line segments 
surrounding it and supporting it. Since this framework of sticks is covered with 
blossoms, it is overwhelmingly tempting to refer to it as the trellis of the spline curve. 
More technically, the vertices of the trellis are those blossom values /(ui, . . . , u n ) 
that are well-defined under wild overloading and all of whose arguments u< are knots. 
The line segments of the trellis consist of all blossom values that are well-defined 
under wild overloading and all but one of whose arguments are knots. The algebraic 
properties of the blossom provide a helpful guide to the geometric intricacies of the 
trellis.* 

One difficulty in drawing the trellis of a spline curve is knowing when to stop. In 
Figs. 17.1 through 17.6, we drew all of the lines involved in determining the behavior 
of the spline over the two adjacent segments F([i,u]) and ^([u,*;]). We can avoid 
the problem of knowing when to stop by considering a periodic spline, where we can 
draw everything with a finite amount of work. There are two types of periodicity 
th at a sp line can possess: additive and multiplicative. In the additive case, we have 
F(u + C) = F(u) and ii+\f = ii + C for some positive real constant C and positive 
integer Af . Fig. 17.7 shows an example of a additively periodic quadratic spline in 
which C = 4, M — 3, and the knot sequence has the form (. . . , 0, 2, 3, 4, 6, 7, 8, . . .). 
Each de Casteljau Diagram around the cycle of a periodic spline has an associated 
ratio. In Fig. 17.7, those ratios are 1, 2, and 1/2. In order for additive periodicity to 
arise, the product of these ratios around the cycle must be precisely 1. There is no 
reason why this product should take any particular value, however. If the product is 
not 1, the result is a spline that is multiplicatively periodic: we have F(Cu) = F(u) 
and U+M = Ci% for a positive real C and a positive integer M. Fig. 17.8 shows an 
example of a multiplicatively periodic quadratic spline in which C = 8, M = 3, and 
the knot sequence is given by U = 2*. 

Whatever flavor of periodicity a spline possesses, the periodicity allows us to 
draw the entire trellis of the spline in a finite diagram. Figs. 17.9 through 17.13 are 
examples constructed as follows: I copied a closed control polygon with 12 vertices 
from a paper by Robin Forrest [23]. I then constructed the additively periodic spline 



* Without the blossom as a guide, one might resort to drawing less than the full 
trellis; for example, Fig. 5 in Sablonniere's paper [37] is a subset of the quintic trellis 
shown in our Fig. 17.5. 
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/(1,2) = /(8,16) 

/ \f(i,i) 

Fig. 17.8 A quadratic spline curve that is multiplicatively periodic 

curves of degrees 2 through 6 with equally spaced knots determined by those 12 de 
Boor points. It isn't really relevant to this paper, but Fig. 17.14 shows what happens 
as the degree gets higher; it shows the splines of degree 2* based on the same control 
polygon for t in [0,8], All of the curves for degrees 512 and up are located within 
the small dot at the centroid of the 12 control vertices. Finite Fourier transforms 
are a good tool for studying the process of convergence to the centroid. 
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Fig. 17.9. The trellis of a periodic quadratic spline curve 




Fig. 17.10. The trellis of a periodic cubic spline curve 
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Fig. 17.11. The trellis of a periodic quartic spline curve 
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Fig. 17.12. The trellis of a periodic quintic spline curve 
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Fig. 17.13. The trellis of a periodic sextic spline curve 
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Fig. 17.14. The spline curves of degrees 2* for i in [0,9] 
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18. The blossoms of B-splines 

In the standard theory, a Bezier curve is the result of blending together its 
Bezier points using the Bernstein basis polynomials as the weights. Analogously, a 
spline curve is the result of blending together its de Boor points using B-splines as 
the weights. Our blossoming technology led us to the de Boor Algorithm, which is 
a way of computing a spline curve from its de Boor points. Hence, we already know 
about B-splines in a sense. But the connections between what we know so far and 
the standard explanations of B-splines are a bit subtle. In the standard theory [8], 
B-splines are defined either by a recurrence [10] or by a formula involving divided 
differences [9]. Spline curves are then defined as linear combinations of B-splines. 
Finally, it is shown that spline curves can be computed using the de Boor Algorithm 
[11]. In this section, we shall reverse that process. We shall define B-splines to be 
the weighting functions that result from the de Boor Algorithm. Then, we shall 
prove that those B-splines satisfy the recurrence and the divided-difference formula. 

Let L be the affine line, let {f»} be a knot sequence in L, let F:L — ► Q 
be an n-ic spline curve with as its knot sequence, and let {xy} where Xy = 
/(f y) Fy4. n+1 )(^+i)--,^+n) be the sequence of de Boor points of F. The de Boor 
Algorithm computes each point F(u) on the spline F as an affine combination of 
the de Boor points. We define the real-valued spline functions By (n+1 (u) to be the 
coefficients in this affine combination: 

F(u) = E S i.«+i(")/(fy 1 f J+ „ +1 )(*i+i.---.*y+«)- (18-1) 

3 

These coefficient functions By (n+ i(u) are called the normalized univariate B-splines 
of order n+1 for the knot sequence {FJ; the notation JVy (n+1 (u) is also common. 
The " + 1" in the second subscript arises because we are using n to denote the degree 
rather than the order. 

We can compute B-splines by considering spline curves F: L -> R, whose object 
space is the real numbers. The de Boor points Xy = /(f y ,r y+B+l )(ii+ 1, . . . Jj+n) of 
such a spline are themselves real numbers. From Eq. (18.1), we can see that the 
B-spline iJy.n+i : L — * R is simply the n-ic spline curve whose de Boor points x» in R 
are all equal to 0 except for xy, which is 1. Using the Kronecker delta notation, we 
can write this more compactly as x» = £ t y. Therefore, we can compute the B-spline 
B o,n+i by applying the de Boor Algorithm, starting with the de Boor points x t = 5 f *y. 

One source of complexity in that computation is overloading. Note that both 
the F(u) and the 5y in+1 (u) in Eq. (18.1) are splines, that is, piecewise polynomials. 
The pieces that make up F(u) are the polynomial curves F k (u) for each Jfc with 
(*Jfc>**+i) nonempty. In a similar way, let us define £y in + lt *(u) to be the n-ic 
polynomial that the B-spline B jtn+1 (u) follows for u in' (i k} ik+i)\ we shall call 
Bj t n+i,k a B-polynomiaL The non-overloaded version of Eq. (18.1) is 

= £ fl i>+l.fc(fl) f({,,l i+n+l )(tj+U ■ • ■ ,t,+n), (18.2) 

3 

which expresses the polynomial curve F k as a weighted sum of de Boor points, with 
the B-polynomials as the weights. In particular, the B-polynomial value B jtn + ltk (u) 
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gives the influence of the j th de Boor point on the value of the fcth spline segment at 
the time u. If we blossom both sides of Eq. (18.2), we can generalize this observation 
to bags of times {tii, . . . , u n } whose n components aren't equal. We have 

,tX n ) = 5^fci,n+l,ib(Sl» - ■•»«„) /(C yi «y +ffc+ i)(*i+l> • - ( 18 « 3 ) 
3 

where iy.n+i,* denotes the blossom of the B-polynomial £y,n+ 

Although the sums in Eqs. (18.2) and (18.3) are formally infinite, only n + 1 
of the terms are actually nonzero. From Prop. 15.3, we know that the jth de Boor 
point influences the Jfcth spline segment precisely for k in + n]. Turning this 
around, we deduce that the Arth segment is influenced by the jth de Boor point 
precisely for j in [A; - n, A:]. For j outside of [k - n, A;], the B-polynomial By.n+i.fcft*) 
is identically zero. (Throughout this section, the index j will always name a de 
Boor point and the index k will always name a spline segment. Any value of j is 
legitimate, but the only legal values of A: are those whose domain intervals (fjt,tjt+i) 
are nonempty.) 

There is an interesting way to think about the evaluation of the segments Fk(u) 
of a spline curve. For each A: with (Ffc, tfc+i) nonempty, we can compute the blossom 
value /*(ui, . . . , u n ) by using the Generalized de Casteljau Algorithm starting with 
the n + 1 poles of which are precisely the de Boor points that influence that 
is, the points . . - ,*y+ n ) = x j for 3 in [k - n,k]. We found in Section 7 

that the resulting computation scheme is triangular, shown for the case n = 3 
as the downward-pointing triangle in Fig. 7.4. Something interesting happens if 
we consider fixing the blossom arguments {ui, . . . ,u n }, but varying A:. Of course, 
this is a pretty weird thing to do. Normally, we are interested in the value of 
• • • j w n ) only when the points u, are in the interval (tjb,tfc+i)> or at least close 
to that interval. Indeed, if we hadn't removed the overloading first, there wouldn't 
be any k around to vary: its value would be deduced by some overloading convention 
from the arguments u». But weird ideas sometimes pay off. What happens in this 
case is that the triangular computation schemes for different values of A; overlap in 
large party and can hence be assembled to form one long, diamond-celled lattice, 
reminiscent of the old-fashioned, unsafe type of baby's gate. Fig. 18.4 shows a chunk 
of this lattice in the case n = 3. The de Boor points xy of the spline F form the 
top row of the lattice. Each subsequent row is computed from the row above by 
taking affine combinations controlled by one of the arguments u», as specified in 
the de Boor Algorithm. The bottom row gives the values := /jt(«i> - . . , tin) that 
result from evaluating the blossom f k of each segment F k of the spline F on the 
fixed argument bag {ui, . . . , u n }* 

If there are multiple knots, this lattice will have triangular notches cut out of the 
bottom of it, whose heights correspond to knot multiplicities. There are two ways 
to see this. The naive way is to note that multiple knots would cause some of the 
edge-labels in the full lattice of Fig. 18.4 to specify division by zero; those edges and 
the points underneath them must be deleted. More formally, if = • • • = *fc+ m 
is a knot of multiplicity m > 2, we have a computational triangle for fk and one 
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for fk+my but the intervening index values correspond to empty domain intervals, 
for which there are no corresponding functions or computational triangles. It is the 
absence of the triangles for /fc+i through fk+m-i that generates a triangular hole 
of height m at the bottom of the lattice. 

The nodes in the lattice in Fig. 18.4 are labeled to reflect a computation of 
the blossom values fk(&i, . . . , u n ) for the arbitrary spline F. Fig. 18.5 shows the 
same lattice, but with the node labels omitted. We can use this lattice to compute 
R-polynomial values &y )n +i ( fc(ui, . • • ? u n ) in three different ways. 

The first way uses the lattice top-down, the way that we developed it, exploiting 
the observation that B-splines are just real-valued spline curves whose de Boor 
points are all 0 except for a single 1. Choosing a fixed j, we set the top-row nodes 
x^ by the rule x t := Sij and then we compute successive rows, as in the de Boor 
Algorithm. The output of this computation, on the bottom row, will be the values 
yjfe = 6 J(n+ i ( jfe(tii, . . . ,u n ) for all k with ik < ffc+i- Intuitively, we have computed 
the influence of our chosen jth de Boor point on all segments of the spline at the 
fixed bag of times {ui, . . . , u n }. 

If we label each node in the lattice as we do this top-down computation, what 
interpretation can we give to the labels? For each node, consider the monotonic 
paths in the lattice that reach that node, starting from the chosen top-row node xy. 
The label on each node will be precisely the sum, over all such paths, of the product 
of the edge-weights along that path. In particular, suppose that we also choose a 
fixed node yjt on the bottom row, that is, a fixed segment of the spline. There will 
be precisely ( fc 2y) monotonic paths in the lattice from the chosen top-row node Xy 
to the chosen bottom-row node y*. The blossom value bj in +i t k{&i, . . . , u n ) is just 
the sum, over all such paths, of the products of the edge-weights along that path. 
This constitutes a second way of using the lattice of Fig. 18.5 to compute values of 
the B-polynomials. 

The symmetry of that second approach between top and bottom suggests that 
we could also use the lattice yet a third way: bottom-up. Choose a fixed &, with 
ik < Ffc+i. Set the bottom-row nodes y» in the lattice by the rule y^ :— 8ik- Then, 
compute values for nodes in successively higher rows, taking linear combinations 
as specified by the edge weights. The resulting labels on the interior nodes of 
the lattice will be quite different, under this bottom-up approach, than they were 
under the top-down approach. Note, in particular, that the two weights involved 
in one linear combination on the way down the lattice always sum to 1, making 
that linear combination an affine combination. But the two weights involved in a 
linear combination on the way up the lattice generally don't sum to 1. What labels 
will this bottom-up process give to the nodes in the top row? Since the notion of 
weighted paths is symmetric, we deduce that the label on the jth top-row node will 
be Xy = 6y ( n+i,fc(tii> • • • >u n ). Intuitively, the bottom-up approach chooses a spline 
segment and computes how that segment is influenced by all de Boor points, while 
the top-down approach chooses a de Boor point and computes how that de Boor 
point influences all segments. 

The bottom-up way of evaluating the lattice in Fig. 18.5 corresponds to one of 
the two standard definitions for B-splines, the one based on a recurrence relation. 
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The blossomed form of that recurrence relation is 

bj.n+lAui, • ■ • > An) := * ft J>,fc(ttl • • • , «n-l) + 

; kj+x^Uu . . . , U n _i). 

If we set all of the U{ equal to a common value u and then use overloading to infer 
the proper value of k from the value of u, we arrive at the standard version of the 
B-spline recurrence [10]. 

Proposition 18.6. The normalized univariate B-splines i?y, n +i f° r the knot se- 
quence {£,}, which we defined in Eq. (IS.lJ as the weights with which the de Boor 
points are blended together to form an n-ic spline curve, are given by the recurrence 

^ i(ti) := I 1 tfoe(f,-,fy+i) 

J ' v ' l0 otherwise 

*i..+x(«) ~ r^V^.nW + / J+n+1 " U B J+ X.n(")- □ 

*y+n - ty £ y+n+i - ty+i 

The bottom-up approach has an interesting feature: given a fixed knot sequence 
{?,}, we can start off the bottom-up approach without knowing what the degree n 
is. We first compute the linear B-splines based on the knot sequence {U} } then the 
quadratic B-splines based on that knot sequence, and so on; we can stop whenever 
we like. By contrast, we have to know the degree at the outset in order to use the 
top-down approach. 

Proposition 18.7. The normalized univariate B-splines S J>n+ i(u) for the knot 
sequence can also be computed by the divided-difference formula 

= (*y+n+i - *y)[*y> ■ - - ' - «)+• ( 18 - 8 ) 

Proof First, we have to interpret the formula. The expression ( • — ti)+ denotes that 
function of x whose value is (x — u) n when x > u and is zero otherwise, a so-called 
truncated power function. Using an inequality-based form of the Kronecker delta, 
we could also write that function as 

(£r>u)(s- 

The expression [tj t . . . ,ty+ n +i] that precedes the truncated power function is an 
instance of the divided- difference operator. If j:R — ► R is some function, the di- 
vided difference [tj> . . . 9 tj+ n +i]g(z) is the coefficient of x n+1 in the unique polyno- 
mial of degree n + 1 that interpolates the function g at the n + 2 sampling points 
{iy,...,ty+ n +i}. Divided differences have a beautiful theory, which Carl de Boor 
lucidly explains in his book on splines [9], We will hardly scratch the surface of that 
theory in our verification of Eq. (18.8). To begin that verification, we shall remove 
the overloading from Eq. (18.8) and then blossom the result. 
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The overloading in Eq. (18.8) comes from the fact that u is used both in the 
power function (z-u) n and also to control the location at which that power function 
gets truncated. To remove the overloading, let §k be some point in the interval 
('*>'jfe+i) for each k with < Then, we have 

*j>+i,*(fi) = (*i+n+i -*y)[*yi...,«y+n+i](*^ (*->.*)(*- «)")• (18.9) 

It doesn't matter which point §k in the interval (ffc,tjb+i) we choose as the threshold 
of truncation because the divided-difference operator only examines the values of its 
functional argument at the knots U. We can blossom both sides of Eq. (18.9) quite 
easily; switching the constant factor (ty+ n+ i - 1 3 ) over to the left-hand side to save 
space, we have 



&j>+l,fc(ui,..-,ttn) 



n 

[t jy . . . , ty +n+ i] (x ~ (5 x > Sk ) ]J(Z - «,)) . (18.10) 



Having generalized Eq. (18.8) to form Eq. (18.10), we now begin to specialize 
again. Instead of allowing an arbitrary argument bag {tii, . . . , u n }, we claim that 
it is enough to consider the particular argument bags Ei := . . . ,f t + n } for s 

in [A; - n, k\. The proof of this claim essentially repeats an argument that we used 
in the proof of Prop. 15.1. Note first that both sides of Eq. (18.10) are symmetric, 
multiaffine functions of the points u, ; for this, we need to know only that the divided- 
difference operator is linear. Fix any k with the property that < tfc+i, and 
consider the reference sequence (f*-n+i, . . . ,Fjb+ n ). From Lemma 6.1, we know that 
the n+ 1 simple n-tensors := U+i • • • U+n for i in [k- n, k] form an affine frame for 
the space L® n . We conclude that it is enough to verify Eq. (18.10) for the argument 
bags Ek-n through This leaves us with the chore of proving 

Wft + l,..^ + n) = Wl] ( x ^ fe .J JJ (x _ tt . +|) ) (lfUl) 

for all t, j, and k with < and t in [k - n y k]. 

Let's work on the left-hand side of Eq. (18.11) first. If we substitute the argu- 
ment bag Ei = {f»+i, . . . >ii+n} into Eq. (18.3), we get 

/*(«<+!,.. . ,**+*) = 6j,n+l, A (**+!> . - . > ti-fn) /(fy,f i+n+ i)('i+li • • • > *i+n). 

i 

Since k is in [t,i + n], the left-hand side of this equation is actually the tth de 
Boor point /(f,.^. +n+1 )(t»+i *»+»). Since the de Boor points are independent 
variables, we conclude that the coefficients on the right-hand side must satisfy 
ty.n+i.ifc^t+i, . Ji+n) = f>ij- Thus, the left-hand side of Eq. (18.11) is given by 
W('i+n+i - tj)- 

It remains to show that the right-hand side of Eq. (18.11) has the same value. 
Recall that, if g : R — ► R is some function, the divided difference [ty , . . . , f y+n+i ]g(x) 



108 



SPLINE CURVES 



18.13 



is the coefficient of x"" 1-1 in the unique polynomial Z(x) of degree n + 1 that inter- 
polates the function g at the n + 2 sampling points {tj» . . - ,*y+n+i}« (Repeated 
sampling points imply osculatory interpolation; that is, if some real number y oc- 
curs more than once as a sampling point, say m times for m > 1, we demand that the 
polynomial Z(x) osculate the function g(x) to (m - l)st order at y. See Ex. 18.12.) 
In Eq. (18.11), the function g(x) to be differenced is the truncated n-ic polynomial 
g(x) := {S x > 9k )h(x) where h(x) := n!Li( x " *<+*)■ 

All three indices i, j, and k can now be thought of as specifying the positions 
of significant events in the knot sequence: 

(t i+ i, . . . ,t»+ n ) : roots of h(x) 
{tj, . . - »ty +n +i) : sampling points 
. . . | . - . : truncation threshold 

If we sample ^(x) at a root of h(x), we will get zero, regardless of the truncation 
threshold. What happens if we sample g(x) at a knot that isn't a root? Call 
the knots t m for m < t the pre-roots and the knots t m for m > t + n the post- 
roots. Since k is in [t,i + n], the truncation definitely affects all of the pre-roots and 
definitely doesn't affect any of the post-roots. If j < t, all sampling points are either 
roots or pre-roots. Hence, all samplings return zero, the interpolating polynomial 
Z(x) is identically zero, and the coefficient of x n+1 in Z(x) is zero. If j > t, all 
sampling points are either roots or post-roots. Hence, all samplings return h(x)> the 
interpolating polynomial Z(x) is equal to h(x) y and the coefficient of x n+1 is again 
zero, since h(x) is an n-ic. The only chance for a nonzero value on the right-hand 
side of Eq. (18.11) is the case j = t. In this case, the interpolating polynomial 
Z(x) must have all the roots of h(x) as roots and also the last pre-root tj as a root. 
Therefore, up to a scalar multiple w, we have Z(x) = w(x - t 3 )h(x). To determine 
the scalar u;, we use the fact that Z and h must agree at the first post-root i J+n +i. 
This implies that w = l/(tj+n+i -tj)- since w is also the coefficient of x n+ 1 in Z(x), 
we conclude that the right-hand side of Eq. (18.11) has the value 5<y/(tj+ n +i " *i)> 
which agrees with the left-hand side. □ 

Exercise 18.12. Verify that the arguments in the last few paragraphs of the proof 
of Prop. 18.7 remain valid even in the presence of multiple knots. 

Remark 18.13. I made the bold assertion, back in Section 4, that vector-space 
duality would never arise in this paper. Well, hardly ever. The linear blossoms 
&® -Li fc * of the B-polynomials are actually n-covariant tensors on L. In fact, for 
each fixed fc, the n-covariant tensors {&® n+lifc( *} for j in [k - n,fc] form a basis for 
the dual space (L? n )* (sorry about the double use of "*"), which is the dual of the 
basis for the linear space Lf n formed by the n-contravariant tensors • ■ • U+n} 
for t in [k - n,ifc]. That is precisely what the equation 

says. Perhaps dualizing should be added as a fourth principle, to supplement blos- 
soming, tensoring, and homogenizing. 
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Fig. 19.1. A cubic spline curve with knot sequence (0,0,0,6,1,2,2,2,2) 




Fig. 19.2. The cubic trellis of the spline curve in Fig. 19.1 



19. Raising the degree of a spline curve 

Elaine Cohen, Tom Lyche, and Larry L. Schumaker recently published a paper 
giving algorithms for raising the degree of a spline curve [6]. As another demon- 
stration of blossoming technology, let's go over one example from their paper. We 
won't be proving anything new in this section. Our goal instead is to get some feel 
for the rather different perspective on such problems that blossoming provides. 

Fig. 19.1 shows a cubic spline curve F, analogous to the curve shown in Fig. 2 
of [6], with the knot sequence (0,0,0,0,1,2,2,5,2). The labels on the middle three 
of the five de Boor points are well-defined under wild overloading; we shall trust to 
context to disambiguate the first and last de Boor points /(0,0,0) and /(2,2,2). 
Fig. 19.2 shows the trellis of F y which includes the four Bezier points of each of its 
two cubic segments. 

The problem that Cohen, Lyche, and Schumaker pose is to view this curve F as 
a degenerate case of a quartic spline. Now, the joint between ifyj) and ^(i,5) has 
C 2 continuity. If we think of F as a cubic spline, this fact makes I a simple knot. 
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Fig. 19.4. The quartic trellis of the spline curve in Fig. 19.1 

But, if we think of F as a quartic spline, we must make 1 a double knot. In general, 
every knot has to increase in multiplicity by one when we raise the degree, since the 
level of continuity of the corresponding joint remains unchanged. If we let G denote 
F viewed as a quartic spline, the knot sequence of G is (0,0,0,0,0,1,1,3,2,5,5,5). 

In Eq. (10.1), we learned how the blossom g k of any particular segment of G 
could be computed from the corresponding blossom /*: 

_ . Afcl, + A(ui,U 2,t*4) + "3, "4) + fk{u2^ 3y U 4 ) 

0fc( u l» U *> fi 3,«4) = ~ 1 * 

(19.3) 

How does this equation interact with overloading? If ^(ui, u 2 , u 3 , u 4 ) is well-defined 
under either the tame or the wild convention, the four /-terms in Eq. (19.3) will 
also be well-defined under the same convention, since the multiplicity of every knot 
goes down by one as we move from G to F. Therefore, Eq. (19.3) also holds for the 
overloaded functions / and with the subscripts of k deleted. From this, we can 
easily compute the points of the trellis of G as affine combinations of trellis points 
of F. Fig. 19.4 shows the trellis of G in bold, superimposed on the trellis of F . 

Of the ten trellis points of G, the expressions that we get for nine of them are 
affine combinations of no more than two trellis points of F. In those nine cases, the 
G trellis point lies somewhere on the F trellis. For example, we have 

9(0 ,o,l,l) = 2/(Q.MH2/(0,I,I) = /(0iKiU) . 

The tenth case is more interesting: 

m T T 9\ /(0> 1, 1) + 2/(0, 1 ? 2) + /(!,!, 2) 

It is easy enough to reduce these three /-terms to two in various ways._ For ex- 
ample, Fig. 19.4 shows 0(0,1,1,2) as the midpoint of the line from /(0, 1,2) to 
/(I, 1,1). Alternatively, it could have shown g (0, 1, 1, 5) as the midpoint of the line 
from /(6,1,L5) to /(0l,I,2). 
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If we want a single /-term that names the tenth trellis point 0(0,1,1,2), we 
can apply Prop. 10.4. It tells us that such an /-term will exist, and that the three 
arguments of that /-term will be points in the intervals [0,1], [1,1], and [1,2]. 
Computing those points corresponds to factoring the 3-tensor 



01 2 + 2012 + 1 2 2 

e = . 



Using the basis {0,5} for £*, we have 

e = j ^ 0(0 + S) + 20(0 + 28) + (0 + S) (0 + 28) ^ 

^ 40 2 + 80g + 2S 2 ^ 



= 1 

20. Geometric continuity for spline curves 

As we mentioned back in Section 14, a joint between two polynomial curves is 
called geometrically continuous of order fc, or G fc continuous, if the joining curves 
can be reparameterized so as to meet with C k continuity [15]. Let 8 be the parameter 
value corresponding to the joint. A reparameterization is a smooth (that is, a C°°) 
function h from a neighborhood of 8 in the affine line L to another neighborhood of s 
with the property that h(s) = 8 . In order to guarantee that the reparameterization 
does not change the order of continuity of the joint, we restrict ourselves to regular 
reparameterizations, which are those with h $ {s) ^ 0. (The derivative h!{s) is a vector 
on L, of course, so the zero that we are comparing it with is the zero vector on L.) 
We also want the alternative notions of time that h relates to agree about what is the 
past and what is the future; that is, we want h to be orientation-preserving. Hence, 
we actually enforce the sharper constraint h' (s) > 0. Every regular, orientation- 
preserving reparameterization has a functional inverse, which is also regular and 
orientation-preserving. Therefore, instead of reparameterizing both the incoming 
and outgoing curves, it suffices to reparameterize one of them. We shall follow the 
majority convention of reparameterizing the incoming curve. Be warned that some 
sources adopt the opposite convention and reparameterize the outgoing curve [16], 
with the result that their reparameterizations are the functional inverses of ours. 

Let F: (r, §) — ► Q and G: (s, t ) -* Q be two n-ic polynomial curves, and suppose 
that we are interested in the joint between F and G at s. To test continuity, we 
compare the derivatives up through order A: of the reparameterized incoming curve 
F o h to those of the outgoing curve G. Suppose that the Taylor series of h has the 
form 

fc(g) = g + /?!(«-«)* + ft (U ~* )2 S + --- + Pk {U ~ f a)k S + --- , 

where 8 denotes the unit vector on L and # is the scalar by which 8 must be 
multiplied to give the tth derivative hW(s) ofh at s. Note that only the first k terms 
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in this series matter when testing G k continuity. For k = 0, the condition of G° 
continuity is the trivial relation F(s) = G(s), the same as C° continuity. For k = 1, 
we can use the Chain Rule of calculus to find that <3'(s) = (F o h)'(s) = PiF f (s). 
For k > 1, we can use Faa di Bruno's Formula [30], which is the higher-order analog 
of the Chain Rule, to find that 

<*"«- E E <ll( ,y.. M . W(M y. tf-^W- 

u+2* 3 +- 

*Xi*2» ",*fc>0 

De Rose calls these equations the univariate Beta Constraints [15]. Here are the 
first few of them: 

G(a) = F(S) (BCo) 

G'(s) = (J l F'(s) (BCt) 

G"(s) = rtF"(§) + l3 2 F'(s) (BC 2 ) 

gW(8) = fiFW{i) + WifaF"{3) + fi*F'(s). (BC 3 ) 

The curves F and G meet with G k continuity if and only if real constants Pi through 
P k exist with Pi > 0 that satisfy Constraints (BC 0 ) through (BC*). The numbers 
Pi are called shape parameters. 

The first shape parameter Pi is rather special. Consider the reparameterization 
h(u) := s + Px(u — s)S. Since h is an affine map, the reparameterized incoming 
curve F o h will also be an n-ic polynomial curve. Indeed, the behavior of F o h 
on the interval will correspond precisely to the behavior of F itself on 

the interval [r,f], the only difference being that the two domain intervals differ 
in length by a factor of P\. But the length of the parametric interval on which 
each spline segment is defined is already an arbitrary value in our framework, set 
by the designer's choice of the knot sequence. If we also let the designer specify 
an arbitrary value for we would end up with two handles on the same degree 
of freedom, which would be both pointless and confusing. To avoid this, we shall 
restrict ourselves to the case 0\ = 1. The other reasonable choice would be to allow 
Pi to assume different values at different joints but to restrict the placements of 
the knots in such a way that each segment of the resulting spline curve is defined 
over an interval of unit length. One problem with this alternative scheme is that 
it makes it harder to study the limiting process in which two simple knots coalesce 
to form a double knot. In the scheme that we are adopting, the knots can merge 
smoothly. Indeed, if two simple knots coalesce, each corresponding to a joint with 
G n ~ x continuity, the result is a double knot, corresponding to a joint with G n ~ 2 
continuity, whose associated reparameterization is the functional composition of the 
reparameterizations of the coalescing knots. In the alternative scheme where fi\ is 
allowed to vary but all intervals on the parameter line must have unit length, the /?i 
value of the joint entering the disappearing segment tends to zero while the Pi value 
of the joint leaving that segment goes to infinity in such a way that the product of 
these two 0\ values remains constant. 
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A 




Fig. 20.1. Two parabolic segments that join with G 2 continuity 

The shape parameters from /? 2 on up really do give us the ability to form new 
joints. As an example, consider the two parabolas shown in Fig. 20.1, the curves 
y = 3 — x 2 /4 and x = 3 — y 2 /4. These two parabolas both go through the point 
(2,2), and they both have slope -1 there. By symmetry, we deduce that they have 
the same curvature there as well. Suppose that we form a quadratic polynomial 
spline curve as follows. For u in (0, 1), we define F 0 (u) = (2u,3 - u 2 ), thus following 
the parabola y = 3 - x 2 /4 from the point (0, 3) to the point (2,2). For ti in (1, 2), we 
define F^u) = (4u - u 2 - 1,4 - 2u), thus following the parabola x = 3 - y 2 /4 from 
the point (2, 2) to the point (3,0). An easy computation shows that i^(l) = (2, -2) 
and Fg(I) = (0,-2), while F[{1) = (2,-2) and *\"(1) = (-2,0). Hence, from the 
perspective of parametric continuity, the spline curve F is C 1 but not C 2 at u = 1; 
that is to say, 1 is a simple knot. 

Since F has curvature continuity at 1, as we argued above by symmetry, we 
would expect F to be G 2 continuous there. To verify that this is the case, we 
substitute the derivatives above into the Beta Constraints. Constraint (BCq) holds 
and Constraint (BCi) holds for 0i = 1, which is the only legitimate value for £i 
according to our convention. We can make (BC 2 ) hold by choosing 0 2 = -1: 

F»(l) = (-2,0) = fi*Fg{l) + 0 t Fl{l) = (0,-2) - (2,-2). 

Thus, the spline F does have G 2 continuity at the point u = 1. 

Note that the joint between F 0 ([Q, I]) and ^([l, 5]) is surrounded by three lines 
that form a roman letter "A" much like a de Casteljau Diagram with 2 shells, except 
that the ratios of the lengths of the segments aren't in the de Casteljau pattern. This 
observation is valid in general. Starting from Eq. (8.3) and the Beta Constraints, it 
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Fig. 20.2. The A-frame surrounding a G 2 joint between cubic segments 

is straightforward algebra to check that the two Bezier n-ic segments F : (f , a) — ► Q 
and G: (a,i) —*> Q meet with G 2 continuity at a if and only if the last three Bezier 
points of F([r, a]) and the first three Bezier points of G([a, i\) are coplanar and they 
can be connected by lines to form an A-frame, as shown in Fig. 20.2, where the apex 
of the "A" is a point added in the construction. The distance ratios a and A are 
related to the knot values and the shape parameters by the formulas 

««-• a 1 1 ft f*-*v 

* = fix — and J-l- (n _ lW<r+1) \—r) ' 
In our special case of Pi = 1, these formulas simplify to 

a-r A n - 1 \a + 1/ 

The joint between F and G has C 2 continuity when 0% = 0, and hence A = 1. In 
this case, the joint is surrounded by an honest de Casteljau Diagram of order 2, in 
which all three line segments are divided in the ratio 1 : a. As we move A away from 
1, we get joints that are G 2 continuous but not C 2 continuous. The example joint 
between parabolas in Fig. 20.1 has A = 2. 

The name "A-frame" is most appropriate in the case where A is some positive 
constant. It is also possible to have G 2 joints in which A = oo or A < 0. In the 
case A = oo, the legs of the "A" are parallel and the apex of the "A" has moved 
off to the line at infinity; the name TL-frame seems appropriate for this case. When 
A < 0, the legs of the "A" point in towards the bottom, and we might call the result 
a V -frame. 

Exercise 20.3. Generalize the concept of an A-frame from A: = 2 to arbitrary k. 
More precisely, prove the following: for any A;, G k continuity at the joint between 
two n-ic curves is enforced by a diagram with the same points, lines, and incidence 
structure as a de Casteljau Diagram with k shells, but with different distance ratios. 
The distance ratios that arise for k > 3 are, unfortunately, rather complicated 
functions of the shape parameters. 
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Fig. 20.5. Attempts at labeling the vertices of an A-frame 

Challenge 20.4. Figure out a comprehensible general form for those distance ra- 
tios. Use this form to analyze the different ways that a G k joint can be "turned 
inside out," as the shape parameters are adjusted, by having portions of the enforc- 
ing diagram pass through each other or through the hyperplane at infinity. While 
you are at it, figure out good ways to control spline curves with G k joints for k > 3. 

What happens if we study the A-frame in Fig. 20.1 from a blossoming point of 
view? If / 0 and /i denote the blossoms of Fq and F\ 9 five of the six points in the 
A-frame are easy to label as blossom values, as is done in Fig. 20.5. Note that f 0 
and /i will agree on all argument bags that include a 1, since the curves F 0 and 
Fx join with C 1 continuity there. The big question is what to label the apex of the 
A-frame. 

Since / 0 is multiaffine, one possible label for the apex is /o(6, 3). Symmetrically, 
we could also label it /i(^l, 2). The difficulty with these labels is that it isn't obvious 
where the new arguments 3 and —1 have come from. Note that the proper values 
for these new arguments depend upon A, and hence upon the shape parameter 

From the point of view of understanding the structure of the spline curve it 
would be much nicer to label the apex of the A-frame /(0, 2), for some meaning of 
the function symbol /. Note that the curvature continuity of F at the joint implies 
that the tangent lines Osci ^(6) and Osci F(2) to F at the points u = 6 and u = 2 
are not skew, as they might be if the joint at u = I was only C 1 , but actually 
intersect. By the differential perspective on the meaning of a blossom's arguments, 
we would like to label that intersection point /(0, 2). If we use the same tangent-line 
rule to determine the value of /(u,t>) for all ti and v in [0,2], the resulting function 
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will be rather complicated. If we have both u < 1 and v < 1 or both u > 1 and 
v > 1, the value /(u, v) will behave in a biaffine manner as we vary u and v. But, if 
we have u < 1 < v, the value /(u,t;) responds to changes in u and v in a complex 
manner, which depends upon the shape parameter #2- 

Challenge 20.6. Find a simple and enlightening way to blossom a spline curve 
that has geometric continuity. If your technique gives the apex of the A-frame in 
Fig. 20.5 the label /(6,2), it should include an insightful study of the properties of 
a non-multiaffine blossom, such as the resulting /. 

Geometric continuity is more subtle than parametric continuity in many ways; 
here are three to think about. 

First, geometric joints have associated shape parameters. It is tempting to 
try to use those new degrees of freedom to get more smoothness or more flexibility 
out of spline curves of some given degree. Unfortunately, such schemes are fraught 
with peril because of the nonlinear nature of the Beta Constraints. In order to 
exploit the shape parameters to achieve some abstract goal, one would generally 
have to determine their values by solving nonlinear equations — in the worst case, 
a global system of nonlinear equations over the whole spline. The current success- 
ful schemes for drawing spline curves with geometric continuity skirt this peril in 
different ways. The various flavors of Beta-splines [1, 2, 26] avoid the nonlinearity 
entirely by treating the shape parameters as dials that the designer can turn to 
explore a wider universe of smooth shapes. Once the shape parameters have been 
set by the designer, the problem of nonlinearity disappears. Gerald Farin suggested 
a wonderfully simple technique for drawing G 2 cubic splines, in which the designer 
specifies the two middle Bezier points of each cubic segment [21]. With the notations 
of Fig. 20.2, Farin's insight was that the location of the joint w could be computed 
from the locations of the four coplanar points p, q, r, and s by the equation 

(^)'--(¥)©-(Hf)(^)- 

The problem of nonlinearity shows up in Farin's scheme as the restriction that the 
points p and s must lie on the same side of the crossbar, the line joining q to r; 
otherwise, the formula for a demands taking the square root of a negative number. 

A second interesting thing about geometric continuity is that it enables the 
existence of knots with multiplicity zero. A knot of multiplicity m in an n-ic spline 
is, in the geometric case, the parameter value of a joint that has <3 n ~ m continuity, 
with associated shape parameters 0 2 through £ n _ m . It is perfectly possible to have 
m = 0, that is, to have a G n joint in an n-ic spline. In fact, the quadratic spline in 
Fig. 20.5 is a case in point. It has 0 and 2 as triple knots, and it has 1 as a zero-fold 
knot with f} 2 = -1- 

It is even possible to have knots with negative multiplicity, since, in certain 
degenerate situations, two n-ic curves can meet with G k continuity for some k > n 
without being the same curve. Consideration of the Beta Constraints shows that 
this can happen only when the n-ic curves being joined are degenerate in the sense 
that they lie in a flat of dimension less than n, since the relations (BC n +\) through 



20.7 



GEOMETRIC CONTINUITY FOR SPLINE CURVES 



117 




Fig. 20.7. Two quadratic spline curves with G 2 joints 

{BC k ) will be linear dependencies among the derivatives. An extreme example is the 
two n-ic plane curves F(u) := <u,u n ) and G(u) := (u + u n ,u n ). These two curves 
meet with G 2n ~ 2 continuity at u = 0, but not with G 2 "" 1 continuity. To see this, 
one can examine the Beta Constraints to find that the values f} 2 = •* • = /? n -i = 0, 
Pn = n!, £ n+1 = ••• = p2n-2 = 0 are the unique choices that will make (BC Q ) 
through (BC 2n -2) hold, but that no value of #jn-i will make (5C 2n -i) hold. 
The corresponding reparameterization is h(u) := u + u n . This means that the 
parameter value 0 where the curve segments F(pT,0]) and G([0, 1]) meet is a knot 
of multiplicity 2 - n. Note that a knot of negative multiplicity, by enforcing a 
degeneracy, reduces the number of control points needed, just as a knot of positive 
multiplicity increases their number. We would need n + 1 control points to specify 
a single polynomial n-ic segment ff(FI, I]); but the two-segment spline formed out 
of F([-1,0]) and G([6, 1]) needs only (n + 1) + (2 - n) = 3 control points. 

For our third example of the mysteries of geometric continuity, consider the 
five arcs of parabolas shown in the pentagram on the left in Fig. 20.7. Each joint 
between one arc and the next is G 2 continuous, since it is surrounded by an A-frame 
with <t=1 and A = 2<f> (corresponding to /? 2 = ^-3), where <f> := (l+\/5)/2 denotes 
the golden ratio. Therefore, suppose that a designer asked us to draw a quadratic 
spline curve with the parameter interval [0,5], with 0 and 5 as triple knots, and 
with each point in the set {1,2,3,4} a zero-fold knot with fa = <f> - 2. Zero-fold 
knots don't affect the number of control points, so we would expect such a spline 
curve to need three control points, the same as a single parabolic segment. In fact, 
every such spline curve F will have, as trellis, an affine transform of the pentagram 
shown in Fig. 20.7; in particular, every such spline will satisfy F{§) = F{1). Thus, 
it does indeed take three control points to specify an instance of such a spline, but 
we couldn't use the starting point and the ending point as two of those three control 
points. The triangle on the right in Fig. 20.7 shows that the same phenomenon of 
enforced circularity can arise with only three segments, if we are willing to allow 
V-frames, that is, A < 0. Each joint between two parabolic arcs in that triangle is 
G with a — 1, A = —2, and fa = -3. The corresponding construction in a square 
would have four Il-frames, each with a — 1, A = oo, and /3 2 — -2. 



Part F: Spline Surfaces 



Spline surfaces are considerably more subtle than spline curves. Blossoming 
can give us a new perspective on the subtleties involved, but it doesn't magically 
make them go away. Because of those subtleties, we shall study spline surfaces with 
a rather different strategy. Instead of trying to be general, we shall focus on two of 
the simplest and most symmetric situations imaginable: patches that are squares 
and patches that are equilateral triangles. In Section 22, when we deal with square 
patches, we will end up succeeding in our quest for good labels. In Section 23, when 
we tackle triangular patches, we will be left with the tantalizing possibility that 
good labels of some as-yet-unknown structure might exist. Section 21 sets the stage 
for those two investigations. 

21. Joints between polynomial surfaces 

Before one can construct a spline function F:P — ► Q, one must partition the 
parameter space P into regions. In the case of spline curves, the parameter space 
P is the affine line L; since we presumably want our regions to be connected, the 
only choice is to partition L into intervals. But the parameter space of a spline 
surface is a plane, and there is a tremendous variety of ways to partition a plane 
into connected regions. Since we will want the surface patches that are defined on 
adjacent regions to join with at least C° continuity, the boundaries between adjacent 
regions can't be too chaotic. In particular, they had better be segments of algebraic 
plane curves. But that still leaves plenty of freedom of choice. 

In our first step away from generality and towards simplicity, we shall restrict 
ourselves to the case where the parameter plane P is partitioned into convex poly- 
gons. This means that the boundary between two adjacent regions in P is a line 
segment. 

We want the surface patches defined on adjacent regions in P to join with 
some flavor of continuity. One could choose to enforce either a parametric or a 
geometric continuity constraint. In our second step towards simplicity, we shall 
restrict ourselves to the parametric case. That is, if A and B are two adjacent regions 
in the partition of P, we shall demand that the polynomial surfaces Pa: A — ► Q and 
Fb'.B—>Q agree to ifeth order at all points u along the boundary between A and 
B y for some choice of A:, without any reparameterization. 

Taken together, these two simplifying assumptions have an interesting conse- 
quence, when viewed from a blossoming point of view. 

Proposition 21.1. Let A and B be two adjacent regions in the parameter plane 
P that meet along the segment [s,t] of the line L in P, where s ^ t. Two n-ic 
polynomial surfaces F A :A -* Q and F B ' B^Q join with C k continuity along 
the segment [s,t] if and only if their multiafRne blossoms f A and js agree on all 
Argument bags that include at least (n - A:) points anywhere along L. 

Proof. By Prop. 8.8, Fa and Fb will agree to kth order at some point v in [s,t] if 
and only if their blossoms satisfy /|(u a •••u fc v n - fc ) = /f (u a • • -u fc v n - fc ). If this 
identity holds for all v in the segment [s,t], it must also hold for v anywhere on the 
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entire line L, since /a and /b are given by polynomials. Furthermore, suppose that 
we fix Ui through for a moment. The resulting function 

v *-+ /{a,b}(ui, . . . ,u fc , v, . . . ,v) 

n-k 

is a polynomial curve of degree n — k defined on the line L, The blossom of this 
curve can be computed either by the rule 

{Vfc+i, . . . , V n } i-> /a(Ui, . . . ,11*, Vfc+i, . ..,V n ) 

or by the same rule with fs replacing /a- Since blossoms are unique, we conclude 
that /A(ui,...,u A; ,v fc+1 ,...,v n ) must equal /b(ui, . . . ,u*, Vjb+i, . . . , v n ), even if 
Vfc+i through v n are different points along L. □ 

In this situation, the line L that separates the regions A and B is called a 
knotline, and the number m := n — k is its multiplicity. We can restate Prop. 21.1 
in these terms as follows: two polynomial surfaces of degree n meet with C n ~ m 
continuity along a knotline L if and only if their blossoms agree on all argument 
bags that include at least m points along L. 

It is also worthwhile figuring out what the multiplicity of a knotline means in 
terms of the locations of the Bezier points of the joining surface patches. In the case 
of a C k joint between two curve segments, we found that the last k + 1 Bezier points 
of the entering segment F([f> s]) were joined to the first k + 1 Bezier points of the 
leaving segment ]) by a de Casteljau Diagram with A; shells. Joints between 

surface patches are analogous, but the analogy is rich enough that a few pictures 
are in order. In particular, it will turn out that C k continuity between two n-ic 
surface patches is guaranteed by n — k + 1 overlapping two-dimensional de Casteljau 
Diagrams, each with k shells. (A two-dimensional de Casteljau Diagram is one, like 
the one shown in Fig. 3.4, that is constructed using two-dimensional interpolations; 
it is each interpolation, not the diagram as a whole, that is two-dimensional.) 

Before we can compute Bezier points for the joining patches, we must choose 
a reference triangle for each of them. This choice was pretty obvious in the case 
of curves: the domain of each curve segment was an interval, and we used that 
same interval as the Bezier reference interval. If our surface patches are triangular, 
it is similarly natural to choose the domain triangle of each patch as the Bezier 
reference triangle for that patch. But domain regions that are polygons with more 
than three vertices don't present any such obvious choice. Fortunately, it isn't hard 
to convert from one reference triangle to another. Suppose that we are given the 
Bezier points of an n-ic surface F with respect to the reference triangle Arst in P\ 
that is, we are given the triangle of blossom values /® (rVt fc ) for t + j + k — n. If 
we apply the de Casteljau Algorithm for the point u in P, it produces for us the 
tetrahedron of blossom values /®(r t s J 't*u l ) for t + j + k + I = n. The four faces of 
this tetrahedron are the Bezier points of F with respect to each of the four reference 
triangles Arst, Arsu, Artu, and Astu. By applying the de Casteljau Algorithm 
three times, we could convert from the initial reference triangle Arst to an arbitrary 
reference triangle Auvw. 
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Fig. 21.2. The Bezier points of two cubic surfaces that join with C 1 continuity 



For the study of a knot line L, it is convenient to choose the reference triangles 
of the two joining surface patches so that they share an edge that lies along L. Let A 
be some region to the left of L in P and let B be some region to the right of L. 
Suppose that we know the Bezier points of the surface patch Fa with respect to 
the reference triangle Apst and those of F B with respect to Aqst, where s and t 
are two points on L. If L is a knotline of multiplicity m, Prop. 21.1 tells us that 
ff{p i q^s k t l ) = fs (P*q , »** 1 ) for it + / > m. It is these points, on which the two 
blossom agree, that form the constraint diagrams surrounding the joint. 

For C° continuity, when m = n, the only blossom agreement is given by 
/®(s*t n -*) = /f (s fc t n ~*); that is, the Bezier points that determine the edge curve 
Fa{L) must agree with the Bezier points of Fb(L). 

For C 1 continuity, we can replace one of the copies of either s or t in the ar- 
gument bag by either p or q without destroying agreement between the blossoms. 
Fig. 21.2 depicts the case n = 3. Note that the joint curve is surrounded by n affine 
images of the quadrilateral [p,s,q,t] in P formed by the two reference triangles. 
These quadrilaterals can be thought of in three ways: they either extrapolate right- 
ward from the rightmost two rows of Bezier points of Fa (Apst) to compute the 
leftmost two rows of Bezier points of ^(Aqst); or, vice versa, they extrapolate 
from right to left; or, more symmetrically, they constrain the two rows of Bezier 
points of both Fa and Fb that are nearest the knotline. 

Raising the continuity to C 2 constrains the third row of Bezier points as well. 
If n = 3, there are two points in each third row, constrained as shown in Fig. 21.3. 
Note that this diagram includes two points, /{a.b^PjQ* 8 ) anc * f{A t B}{Pi<l> *)> '^ a * 
are not Bezier points of either surface patch. 
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f{A,B}(p>q,t) 



/{A,B}(p,q,s) 



/{A,B}(p,p,t) 



/{A,B}(q, q, t) 



Fig. 21.3. The Bezier points of two cubic surfaces that join with C 2 continuity 

For cubics, going all the way to C 3 continuity forces F A and F B to be identical; 
see Fig. 21.4. The extreme Blzier point /a(p,P,p) of F A (Apst) is joined to the 
extreme BSzier point /a(q,q,q) of ^(Aqst) by a chain of three quadrilaterals, 
each an affine image of [p,s,q,t]. The interior vertices along this chain are the 
blossom values /{A,B}(p,P,q) and /{a,£} (P>q>q)- Fig. 21.4 is a two-dimensional 
de Casteljau Diagram with three shells. It looks rather different from the analogous 
diagram with two shells back in Fig. 3.4, but the difference is only skin deep. In 
Fig. 3.4, the newly added point u lies inside the original reference triangle Arst, 
so we are doing two-dimensional interpolation. In Fig. 21.4, say viewed from left 
to right, the new point q lies outside the original triangle Apst, so we are doing 
two-dimensional extrapolation. In the one-dimensional case, interpolation and ex- 
trapolation look pretty much the same, since they both involve three points along a 
line. In the two-dimensional case, the de Casteljau Algorithm is controlled by four 
points in a plane, and quite different geometries are possible. 

Returning to the C 2 case in Fig. 21.3, we can interpret that figure as two 
overlapping instances of extrapolating de Casteljau Diagrams with two shells, one 
consisting of all points with at least one s in their labels, the other consisting of all 
points with at least one t. In general, as claimed above, C k continuity between n-ic 
surface patches is enforced by n — k + 1 overlapping two-dimensional de Casteljau 
Diagrams, each with A; shells. 

Now that we understand what a C k joint between two surface patches looks like, 
the next step in building a spline surface is to choose a partition of the parameter 
plane P into convex polygonal regions and to select multiplicities for the various 
knotlines where those regions abut. Before starting that, we shall close this section 
with some general remarks about overloading the notation for the blossom of a spline 
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Fig. 21.4. The Bezier points of two cubic surfaces that join with C s continuity 
surface. 

So far, we have always indicated by means of an explicit subscript which region's 
blossom we meant to apply to a particular bag of argument points {u i , . . . , u n } . The 
expression /a(ui, . . . ,u n ) tells us to apply the blossom f A of the patch F Ay while 
the expression /{a,B}(ui, . . . ,u n ) tells us to apply either f A or f B and asserts that 
the two results will be the same. Just as in the case of spline curves, an overloading 
rule is a convention for determining the correct subscript from the locations of the 
points u*. Let Hull(ui, . . . ,u n ) denote the closed convex hull of the points u», and 
let A denote the closure of the region A. Generalizing the case of curves, the tame 
overloading convention considers a region A to be a candidate for defining the value 
/(ui, . . . ,u n ) if and only if 

AnHull(ui,...,u n ) ^ 0. 

The term /(ui, . . . ,u n ) is then well-defined under the tame overloading convention 
if and only if all of the values f A {u ly . . . ,u n ) for all candidate regions A agree. The 
natural generalization of wild overloading to the case of surfaces considers a region 
A to be a candidate if and only if 

dim (An Hull(ui,. . . ,u n )) = dim(Hull(u x , . . . ,u n )). 

Deciding whether or not an overloaded term is well-defined can be a rather sub- 
tle question for surfaces. For example, consider the situation in Fig. 21.5. Suppose 
that we want to assemble a quadratic spline surface out of three patches, defined on 
the three triangular regions A, B, and C We require F A and F B to meet with C l 
continuity along the line qs, and we require Fb and Fc to meet with C 1 continuity 
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Fig. 21.5. A subtle geometry in the domain plane of a spline surface 

along the line qt. But we don't place any continuity constraint on the joint between 
F A and Fc along qr. 

We should pause to verify that these constraints are actually honest. Note 
that, if we had demanded C 2 continuity between A and B and also C 2 continuity 
between B and C, then C 2 continuity between A and C would have been automatic; 
for quadratics, C 2 continuity means equality, and equality is transitive. Thus, one 
might worry that C 1 continuity across the (A, B) joint and also across the (B,C) 
joint might imply something about the (A,C) joint. Here is an example to show 
that the (A,C) joint can fail to be even C°. Let x, y, and z be three distinct 
points in the object space Q. Define /aOm:) := x, /c(r,r) := z, and let the other 
five Bezier points of ^(Aqrs) and F c (Aqtr), as well as all six Bezier points of 
F B (Aqst), be y. The (A, B) joint in the resulting spline surface is C 1 continuous 
because C l continuity only depends on the two rows of Bezier points nearest the 
joint line and all those Bezier points have the common value y. Similarly, the 
(B,C) joint is C 1 continuous. But the (A,C) joint is not even C° continuous, since 
*U(r) = x * z = F c (r). 

Consider the overloaded term /(u,v); is it well-defined? Since Hull(u,v) is the 
line segment joining u to v, the two candidate regions are A and C, under either 
tame or wild overloading. At first glance, it seems that there is no reason why the 
equality /a(u,v) = /cr(u,v) should hold, since there is no continuity constraint at 
all on the joint between A and C. But note that the point v happens to lie on 
the line that separates A from 5. Since that line is a knotline of multiplicity one 
and the argument bag {u,v} includes one point along that line, we deduce from 
Prop. 21.1 that /a(u,v) = /b(u,v). Symmetrically, since u lies on the simple 
knotline that separates B from C, we deduce that /b(u,v) = /c(u,v). Therefore, 
/a(u,v) = /c(u,v); the overloaded term /(u,v) is well-defined, but for a rather 
subtle reason. 

22. Spline surfaces with rectangular patches 

The two most symmetric ways to partition the parameter plane P into convex 
polygons are to use congruent squares or congruent equilateral triangles. In this 
section, we shall consider the case of squares. 

Technically, the parameter plane P is just an affine space, so the concept of 
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a square isn't defined. What we shall really do is to divide P into a lattice of 
congruent parallelograms. Having done so, it doesn't hurt anything to choose two 
adjacent sides of one of those parallelograms as orthonormal basis vectors, thus 
giving P the structure of a Euclidean plane divided into squares. Doing so has the 
advantage of making the symmetries more apparent. 

We hope to associate a polynomial surface of degree N with each square in 
such a way that the resulting patches fit together smoothly. To keep things as 
symmetric as possible, we choose to enforce the same level of continuity across each 
joint between two adjacent squares. What level of continuity should we demand? In 
the case of curves, we can ask for C n ~ x continuity between two segments of degree 
n and still leave sufficient flexibility. But for surfaces, it turns out that we can't do 
nearly that well; we can only get C k continuity for k some fraction of N. 

To see why, let us first clarify our goals somewhat. The higher the level of 
continuity that we demand, the fewer spline surfaces there will be. In computer- 
aided geometric design, it is important that we stop while the splines still have 
local flexibility. That is, we want to be able to change a small region of a design 
without affecting the bulk of the surface. The spline surface techniques that are 
most important in practice are those that achieve, for a given degree, the highest 
possible continuity that still allows local flexibility. 

Note that local flexibility is not as stringent a demand as local control. A 
method of drawing splines has local control if each control point of the method 
affects only a small region of the design. For example, consider C 2 cubic spline 
curves with uniform knot spacing, say knots {U} at the integers U := t. One method 
of controlling such curves involves specifying their de Boor p oints. This method has 
local control: moving the de Boor point /(jrj.s+S) (« - + 1 ) affects onl y the 
four adjacent segments with domains (t - 2, t - l) through (t + 1, t + 2). Another 
method of controlling the same curves involves specifying the locations of the joints. 
This method does not have local control: moving one joint affects the entire spline 
curve, although the effect dies off exponentially. Fig. 22.1 shows one example of 
this. The straight line on top is the cubic spline that interpolates the joints (t,0), 
while the spline curve in the middle is what results if we lift the joint at (0,0) up to 
(0,1). (The de Boor points of the middle spline are at <t,\/3(\/3 - 2) '*').) Despite 
their lack of local control, interpolating cubic splines do have local flexibility. If we 
want to make a local modification, we just have to remember, when we perturb a 
joint by some vector £ in Qq } that we must also perturb its two neighbors by £/4. 
The bottom spline curve in Fig. 22.1 shows what happens if we also lift the joints 
at (±1,0) up to (±1, \). (The de Boor points of this spline are at (t,3&o/ 2 )*) In 
fact, local control and local flexibility are really properties of different things. Local 
flexibility is a property of the spline space itself, while local control is a property of 
a scheme for choosing one spline out of the space. 

How high a level Jfe of C k continuity can we demand at the joints between square 
patches of degree N if we want to preserve local flexibility in the resulting surface? 
To ask the same question a different way, how high a degree N is needed to allow 
C k splines with square iV-ic patches to be locally flexible? We shall first compute 
a lower bound on N> and then show that that lower bound is actually the correct 
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Fig. 22.1. Two different ways of perturbing a cubic spline curve 

answer. 

Proposition 22.2. Spline surfaces built by asaembling square patches of degree N 
with C k continuity at the joints can't possibly have local flexibility ifN < 2k + 1. 
In the borderline case where N = 2k + 2, tlie resulting spline surfaces have at most 
one control point's worth of freedom per internal patch. 

Proof Let F: P — ► Q be a spline surface with square patches of degree N that join 
with C k continuity. We first want to show that N < 2k+l precludes local flexibility. 

Imagine determining the square patches of F in raster-scan order, say from left 
to right and from bottom to top. In this scan, suppose that we have just arrived 
at the new patch F A > whose domain is the square region A shown in Fig. 22.3. We 
can assume that the square patches F B and F D associated with the regions B and 
D have already been determined. The demand for C k continuity between A and its 
neighbors B and D constrains our choice of Fa* We shall show that, if N < 2k + 1, 
then Fa is completely determined by Fb and Fr>. Since every square patch in the 
interior of the spline's domain is just like F A , this is enough to prove that C k spline 
surfaces for N < 2k + 1 can't have local flexibility. 

Suppose, then, that N < 2k + 1; we want to show that F A is completely 
determined by F B and F D . Consider the Bezier points of F A with respect to the 
reference triangle Axyz, which are the blossom values /JfCx'y* - *""'^) for i + j< 
N . Since F A joins Fb with C k continuity along the knotline connecting x and y, 
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Fig. 22.3. Determining the next square patch of a spline surface 

we deduce from Prop. 21.1 that ff^ i y N ~ i - j z^) = /I(x*y N "'- 5 ^) for j < k. 
Similarly, we deduce that /®(x < y* r "'~'* i ) = /|(x*y A r — 'z') for t < k. But we 
have i + j < N < 2k + 1; hence, every Bezier point of F A must satisfy either t < k 
or j < Jfc. Therefore, every Bezier point of F A is determined. 

Note, in fact, that some Bezier points of F A are determined both by F B and by 
F D . If F B and F D disagree about any such Bezier point, there won't be any legal 
choice of F A . For example, this will certainly be the case if F B and F D don't agree 
with each other to fcth order at the point y. But the possibility that F A might be 
over-constrained doesn't affect the validity of our argument, because we are giving 
only an upper bound on the amount of freedom in F A . 

It remains to bound the amount of choice we have in determining F A in the 
borderline case when N = 2k+2. In this case, there will be precisely one Bezier point 
of F A that is not determined: ff (x fc+1 z* +1 ). Thus, a square-patch spline surface 
of degree 2k + 2 with C k continuity has at most one point's worth of freedom per 
interior patch. □ 

The phrase "one point's worth of freedom" is rather clumsy. In the future, we 
shall say simply "one degree of freedom," with the understanding that each degree 
of freedom means the free choice of a point in the object space Q. 

There is a trickier way to prove Prop. 22.2, due to C. de Boor and R. DeVore 
[12], which we might call corner-sectioning. We shall discuss corner-sectioning both 
because of its intrinsic interest and because we will want to use it in earnest when 
discussing triangular-patch splines in Section 23. 

Draw a line L that cuts off a corner of A, and let f 0 through f 3 be the points 
where L crosses the boundaries of the squares B, A, and D, as shown in Fig. 22.3. 
If we restrict the spline surface F to the line L, we will get an iV-ic spline curve 
GiL-tQ,* section of the surface F. Furthermore, if m := N - k is the multiplicity 
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of each knotline, the knot sequence of G is given by 

(..., fo,.y ,rq , fi>- • **u fa»«y >Fa > fs» - ■ >r 3 ) . . . ) . 

TTl fTl f71 TFl 

A de Boor point of G has the form ^(f tl ^ +Ar+1 )(?»+i) • • - >U+n) where is an 
indexing of this knot sequence. Imagine choosing a substring of length N from the 
knot sequence of G. If that substring includes the first occurrence of fi, then the 
validity interval of the corresponding de Boor point will include the interval (f 0 , fi). 
Such a de Boor point can be computed from the behavior of G over the interval 
(^0,^1), and is hence determined by Fb- Similarly, if the substring of length TV 
includes the last occurrence of , the validity interval of the corresponding de Boor 
point will include (r2,fs) and hence that de Boor point will be determined by Fj). 
If N < 2k + 1, and hence N > 2m — 1, there isn't room to fit a substring of N 
consecutive knots of G in between the first f i and the last f 2 , so every de Boor point 
of G that influences the segment G([f2,r3]) will be determined, either by Fb or by 
F&. Hence, the segment G([f2,F3]) will be determined. Since this same argument 
applies to any line L that cuts off a corner of A, we deduce that, when N < 2k + 1, 
the values of F A throughout the triangle Axyz are determined by Fb and F D . Since 
Fa is given by polynomials, this implies that all values of Fa are determined. 

The same corner-sectioning idea can also be used to show that Fa has only one 
degree of freedom in the borderline case when N — 2k + 2, and hence N = 2m — 2. 
In this case, there is exactly one substring of N knots of the spline curve G that fits 
between the first Fi and the last F2. This means that the section G has precisely 
one de Boor point that is free to vary. Choose a point q in the interior of Axyz, 
and consider the various corner-sectioning lines L that pass through q, which form 
a double wedge with vertex q. The value ^U(q) in Q is enough to determine the 
one free de Boor point of the section G associated with each line L. Thus, the value 
Fa(<i) determines all of the values of Fa in the double wedge, which is enough to 
determine Fa completely. 

As part of our quest for simplicity, we shall focus in this paper on the borderline 
case in Prop. 22.2, in which the degree N = 2k + 2 is just high enough to avoid 
precluding local flexibility. For notational convenience, let n be a nonnegative integer 
and set N := 2n and k ~ n — 1. The following proposition shows that locally flexible 
square-patch spline surfaces with those borderline parameters really do exist, thus 
demonstrating that the bound in Prop. 22.2 is tight. 

Proposition 22.4* There is a method for drawing square-patch spline surfaces of 
degree N = 2n with C n ~ x continuity that involves specifying one control point per 
interior patch. (If n is odd, it is more natural to think of the control points as 
associated with vertices rather than with patches.) In this method, each control 
point influences (n + l) 2 patches, which means that each patch is influenced by 
(n + l) 2 control points. 

Proof sketch. We shall describe this method by indicating how its blending functions 
can be defined, where a blending function is the real-valued spline function that gives 
the influence of a particular control point on the location of the eventual surface. 
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In the base case n = 0, we want to put together square const ant patches with 
C _1 continuity, that is, with no continuity at all. It is easy to see how to do this 
with one control point per patch: each control point specifies the value of the entire 
associated patch. In particular, each blending function is identically equal to 1 on 
one square and is 0 everywhere else; that is, each blending function is a translate of 
the function 

*„(») = I 1 »"(M1*M 

1 0 otherwise. 

Note that all of the translates of B 0 (u) together form a partition of unity (if we 
don't worry about what happens on the boundaries of the squares). 

For larger values of n, we can produce blending functions by convolving -B 0 (u) 
with itself some number of times. If G and H are real-valued functions defined on 
P, the convolution G * H is given by 



(G * H)(u) := J J F(v)G(u - v) dv. 



Let B n (u) := (5 n -i * 5 o)(u) for n > 1. For example, £i(u) contains four nonzero 
patches, each of which is a hyperbolic paraboloid: 



B^u) = B x ({u y v)) = < 



uv if u G [0, 1] x [0, 1] 

(2-u)v if US [1,2] X [0,1] 

u(2» v) if uG [0,1] x [1,2] 

(2-u)(2-v) if uG [1,2] x [1,2] 

0 otherwise. 



It is easy to see that B n (u) will be nonzero precisely on the (n+1) 2 unit squares that 
form the square [0, n] x [0, n]. It is also true, though we won't verify it here, that each 
convolution of Bq(u) with itself raises the degree by 2 and the level of continuity 
by 1. Thus, the blending function B n (xi) is a real- valued spline function with square 
patches of degree 2n and with C n " x continuity. For even n, the maximum of B n (u) 
is located at the center of a patch; for odd n, the maximum occurs at a vertex where 
four patches join. 

The convolution process preserves the property that the various translates of 
S n (u) form a partition of unity. Hence, we can use those translates as blending 
functions. We arrive at a method for controlling C n-1 spline surfaces of degree 2n 
with one control point per patch. In this method, each patch of the resulting surface 
is influenced by (n + I) 2 control points. □ 

A general polynomial surface of degree 2n has ( 2n 2 +2 ) = (n + l)(2n+ 1) Bezier 
points with respect to any reference triangle. The surface patches that arise in the 
spline surfaces of Prop. 22.4 are influenced by only (n + l) 2 control points. Hence, 
there must be n(n + 1) constraints of some sort that hold for those surfaces. It 
turns out that we can think of these constraints as constraints on their high-order 
derivatives, as shown in the following lemmas. 

If F: P — ► Q is a polynomial surface and £ is a vector on P, we can construct 
a new surface whose values are vectors on Q by associating with each point u in P 
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the derivative dF u (£) = D^F(u) of F in the £ direction at u. Since we are holding 
the direction £ fixed while varying u, the notation of directional derivatives is more 
natural here; we shall use the symbol D$F to denote this derived surface. 

Lemma 22.5. If F:P —> Q and G: P — ► Q are two polynomial surfaces that meet 
with C° continuity along a line L in P, and if £ is a vector on P parallel to L, then 
the derived surfaces D^F and D^G also meet with C° continuity along L. 

Proof If u is a point on L, we can compute D(F(u) by evaluating F at u and at 
points near u along L. Since F and G take on the same values everywhere along L, 
we must get the same answers when we compute D^F(n) and D^G(u); hence, D^F 
and DfG also agree along L. □ 

Lemma 22.6. In the situation of Lemma 22.5, if F and G meet with C k continuity 
along L, then the derived surfaces D^F and D^G meet with C k continuity along L 
also. 

Proof Let rji through 77* be arbitrary vectors on P. If the surfaces F and G 
meet with C k continuity along L, then the derived surfaces D fll *" D Vk F and 
D ni <" D nk G must meet with C° continuity along L. If we differentiate one more 
time, in the £ direction, Lemma 22.5 tells us that the resulting derived surfaces 
D{D ni '"Drf k F and D{D rtl •••Z) fJfc G will also meet with C° continuity along L. 
Since the various directional derivatives commute with each other, this implies that 
D ni • • • D nk D{F and D ni • • • D nk D^G meet with C° continuity. Since tfi through 
rik were arbitrary, we conclude that D^F and D^G must meet with C k continuity 
along L. □ 

Lemma 22.7. If F:P — > Q and G:P — ► Q are two polynomial surfaces of degree 
2n that meet with C n ~ l continuity along a line L, and if £ is a vector on P parallel 
to L, then the derived surfaces (D{) n+1 F and (D^) n+1 G must be identical, that is, 
they must agree everywhere. 

Proof. If we apply Lemma 22.6 repeatedly, a total of n + 1 times, we can deduce 
that the derived surfaces (D^) n+1 F and (Z?^) n+1 G must meet with C n_1 continuity 
along L. But these derived surfaces are polynomial surfaces of degree at most n — 1; 
if they agree at any point up through order n — 1, they must agree everywhere, that 
is, they must be identical. □ 

Suppose, then, that we build a spline surface F with C n_1 continuity out of 
square patches Fa of degree 2n. Lemma 22.7 will apply to any two adjacent surface 
patches. In particular, let f be a horizontal vector on P and let r\ be a vertical 
vector. All of the surface patches Fa for A in a single horizontal stripe must share 
the same value of (Z^)" 4 " 1 ^, while all of the patches Fa for A in any vertical 
stripe must share the same value of (D^) n+l FA* Thus, these high-order derivatives 
of our surface patches are not locally flexible. We have to make a global choice for 
each horizontal stripe of a single polynomial function of degree n — 1 that will give 
(Z? r? ) n " f x Fa for all A in that stripe, and similarly for vertical stripes. 

How shall we choose, for each stripe, what the derived surfaces shall be along 
that stripe? Since we can't vary this choice locally, the simplest course of ac- 
tion, which we shall adopt, is not to vary it at all. In particular, we shall restrict 
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ourselves to square surface patches F A for which (Z)<) n+1 FA and (£>„) n+1 FA are 
both identically zero. In defense of this choice, we claim that the square patches 
Fa used by the spline-drawing method of Prop. 22.4 do satisfy the constraints 
{Dt) n+l F A = (D n ) n + l F A = 0. To see why, note that it is enough to show that the 
blending function B n (u) satisfies those constraints. And B n (u) must satisfy them 
because it has compact support: if we move out along any stripe far enough, we 
get to patches in B n (u) that are identically zero, and hence certainly have their 
derivatives of all orders equal to zero. 

This explains why each internal patch F A in one of the spline surfaces of 
Prop. 22.4 has only (n+1) 2 degrees of freedom in it, instead of the (2n+l)(n+l) that 
we would naively expect. The derived surface (D^) n+1 F A> being a surface of degree 
n - 1, has degrees of freedom in it, and the constraint {Dt) n+1 F A = 0 elim- 

inates that freedom. Considering the two directions of stripes, we have accounted 
for all 2( n + 1 ) = n(n + 1) missing degrees of freedom. 

Our decision to adopt the constraints (£>{) n+1 F A = (A,) n+1 *A = 0 has a very 
important consequence: it means that we are actually building spline surfaces out of 
bipolynomial patches of degree (n; n). In particular, let F A be a polynomial surface 
of degree 2n that satisfies the constraints (D e ) n+1 *U = (£>r,) n+1 *U = 0. If we view 
the parameter plane P of Fa as the product U x V of two afiine lines, one in the 
£ direction and the other in the r\ direction, the resulting function Fa' U x V — ► Q 
will be bipolynomial of degree (n; n). 

With this insight, we can generalize our results somewhat and also obviate a lot 
of the earlier work in this section by defining bipolynomial spline surfaces as follows. 
Let m and n be nonnegative integers. Let and {tj} be two knot sequences, 
located in the affine lines U and V respectively. We can form a bipolynomial spline 
surface F: U X V — ► Q of degree (m;n) by assigning arbitrary values to the de Boor 
points 

Note that these de Boor points have validity rectangles rather than validity intervals. 
The patches of the resulting spline surface have domain rectangles of the form Ry := 
($*,*Jb+i) x (tj,t'i+i), where k and / are chosen so that s k < k k +i and U < *t+i. 
The patch defined on the rectangle R k i is determined by the (m+ l)(n+ 1) de Boor 
points {xij} for t in [Jfc - m,Jb] and j in [/ - n,/]. Two patches that are adjacent in 
the U direction join with C m ~ r continuity, where r is the multiplicity of the knot 
Si that divides them. Similarly, two patches adjacent in the V direction join with 
C n ~ r continuity, where r is the multiplicity of the dividing knot tj. 

The availability of the concept of a bipolynomial surface makes square-patch 
spline surfaces much easier to understand than they otherwise would be. Let us 
recapitulate the developments in this section. If we assemble square patches of 
polynomial surfaces together with continuity constraints on the joints, we found 
that we perforce end up with global constraints relating the high-order derivatives 
of various patches in the directions parallel to the edges of the squares. Since we 
weren't interested in modifying the surface globally, we chose to satisfy these global 
constraints trivially, by demanding that the relevant derivatives be zero in every 
patch. The critical fact is that precisely those derivative constraints are built into 
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Fig. 22w8. The domain A of one square patch of a spline surface 

the very concept of a bipolynomial surface. Thus, a patch Fa of a square-patch 
spline surface is not a general polynomial surface of degree 2n; but it is a general 
bipolynomial surface of degree (n; n). 

To be explicit, consider the case n = 1, in which each patch Fa is a hyperbolic 
paraboloid. We can view a hyperbolic paraboloid as an instance of a polynomial 
surface of degree two, and blossom it in that way; let g:P 2 —> Q be the resulting 
2-blossom. The 2-blossom g of Fa gives us six Bezier points with respect to any 
reference triangle, but they have to satisfy certain constraints. For definiteness, 
let us choose the reference triangle Axyz, as shown in Fig. 22.8. The constraint 
D^D^Fa = 0 corresponds to the condition flf(y,y) - 2</(y, z) + g(z,z) = 0. Similarly, 
the constraint D^D^Fa = 0 correponds to the condition j(y,y)-2j(x,y)+j(x,x) = 
0. The bipolynomial point of view neatly eliminates the redundancies and con- 
straints. The (1; l)-blossom / of Fa gives Fa the four unconstrained Bezier points 
/(«o;v 0 ), /(uoJ^i), /(uijvo), and /(ui;t$i). Note, in fact, that the constrained 
2-blossom g is precisely the result of applying degree-joining to the unconstrained 
(l;l)-blossom /. Working with g instead of / makes things more complicated be- 
cause the complex degree-adjusting formulas of Part D become relevant. 

23. Spline surfaces with triangular patches 

In this section, we shall study what happens if we attempt to build a spline 
surface based on a partition of the parameter plane P into equilateral triangles. On 
the positive side, we will find that we can get a higher level of continuity for the 
same degree of surface. On the negative side, after we have figured out what the 
resulting derivative constraints are, we won't find an analog of the bipolynomial the- 
ory lying around, waiting to come to our rescue by having precisely those derivative 
constraints built in. 

Let N be the degree of the triangular surface patches. To keep things sym- 
metric, we shall enforce the same C k continuity constraint on all the joints between 
pairs of adjacent patches. Just as in the case of square patches, our first result 
will warn us that setting the degree too low for a fixed continuity precludes local 
flexibility. By the way, note that a triangular grid has twice as many regions as 
vertices: each vertex is both the highest point in some upward-pointing triangle and 
also the lowest point in some downward-pointing triangle. 

Proposition 23.1. Spline surfaces built by assembling triangular patches ofsur- 
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Fig. 23.2. Determining the next two triangular patches of a spline surface 

faces of degree N with C k continuity across the joints can't possibly have local 
Sexibility if2N<3k+l. In the borderline case where 2N = 3k + 2, the resulting 
spline surfaces have at most one degree of freedom per internal vertex, that is, at 
most one control point's worth of freedom for every two internal patches. 

Proof Let F: P — ► Q be a spline surface with triangular patches of degree N that 
join with C k continuity. Imagine determining the patches of F in pairs, where each 
pair consists of an upward-pointing triangle and its left-hand downward-pointing 
neighbor, and where we take the pairs in left- to-right, bottom-to-top raster-scan 
order. In this scan, suppose that we have arrived at the pair of patches Fa and 
F B , shown in Fig. 23.2. We may assume that the patches F c and F D have already 
been determined. If F A and F B are completely determined by Fc> Fd> and the 
continuity constraints across the three joints (D, A), (A, B), and (B,C), then there 
is no chance for local flexibility. 

We can analyze whether or not Fa and Fb are determined in two different ways: 
either by reasoning about Bezier points or by using the corner-sectioning technique 
of de Boor and DeVore [12]. We shall use corner-sectioning, since it is easier. 

Draw a line L that cuts off a corner of both A and B, and let f 0 through r 4 be 
the points where L crosses knotlines as shown in Fig. 23.2. Restricting the surface 
F to the line L gives an N-\c spline curve G, whose knot sequence has the form 

(. ..,f 0 ,...,ro,fi,...,ri,f2,...,r2,f3,...,f 3 ,r4,... ,f 4 ,...), 

^— ^ ' * sy ' N s/ " V s/ ' V v ' 

m m m m m 

where m := N - k is the multiplicity of each knotline. The inequality 2N < Zk + 1 
is equivalent to N > 3m - 1. If this holds, a substring of N knots that is far enough 
to the right to avoid the leftmost copy of r x will have to include the rightmost copy 
of f 3 . Hence, every de Boor point of G that influences either of the two segments 
G ([ f i> f 2]) or G([r 2 ,r 3 ]) will be determined, either by F c or by F D . Wiggling the 
line L around a bit, we conclude that all of F A and Fb will be determined. 

In the borderline case where 2N = 3Jfc+2, we have N = 3m-2. Thus, the spline 
curve G will have precisely one free de Boor point. By rotating the line L about a 
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fixed point q in Apsq, as in the discussion following Prop. 22.2, we conclude that 
the patches and Fb together have at most one degree of freedom in them. □ 

Exercise 23.3. Reprove Prop. 23.1 by explicit reasoning about the Bezier points 
of Fa. and Fb- This is harder than one might expect because there is no single 
reference triangle that has two vertices on each of the three relevant knotlines. One 
way to deal with this complication is to consider the entire tetrahedra of blossom 
values /f (s»tVq m ) and /f (fl f tVq m ) for t + j + I + m = N, the faces of which 
are triangular arrays of Bezier points. Note that 

/®(s^4Vq m ) + />V + V q") = /•(sVp l+1 q«) + /jf (sVp , q- +1 ) 

whenever i + j + l + m = N -1, since s + 1 = p + q; and the same holds for f B . The 
constraint of C k continuity between F A and F D means that the values f$ (a*Vp'q TO ) 
are determined forj+/ < k. Similarly, C k continuity between Fb and Fc determines 
the values /f (s'tVq 171 ) for j + m < k. Finally, C k continuity between F A and F B 
implies the equality /£(sVp l q m ) = f§ (sVp'q™) for / + m < k. If 2N < 3k + 1, 
show that these conditions are enough to determine both tetrahedra entirely. If 
2N = 3A:+2, show that there is precisely one degree of freedom in the two tetrahedra. 

The bound in Prop. 23.1 suggests that using triangular patches will allow us 
to have smoother surfaces for the same degree, since k is roughly 2iV/3 instead of 
roughly N/2. Continuing our policy of pursuing simplicity, we shall focus on the 
borderline case where N := 3n + 1 and k :— 2n for some nonnegative integer n. 
The following proposition states that triangular-patch spline surfaces with these 
borderline parameters do exist. The convolutions used in the proof are frequently 
credited to M. A. Sabin with a 1977 reference [36], but they were also known, in 
1971, to P. O. Frederickson [24]. 

Proposition 23.4. There is a method for drawing triangular-patch spline surfaces 
of degree N = 3n+ 1 with C 2n continuity that involves specifying one control point 
per interior vertex. In this method, each patch is influenced by 3(n + l) 2 control 
points, while each control point influences 3(n + l) 2 upward-pointing patches and 
also 3(n + l) 2 downward-pointing ones. 

Proof sketch. In the base case n = 0, we want to assemble affine patches with 
C° continuity. It is easy to see how to do this with one control point per vertex: 
define each patch by performing a two-dimensional linear interpolation between the 
control points associated with its three corners. Let Bq(vl) be the corresponding 
blending function, which gives the influence of a specified control point. The graph 
of Bq(xl) is a pyramid of height 1 with a regular hexagon as its base. Note that, if 
we translate one copy of this pyramid to each vertex in P, the resulting functions 
form a partition of unity. 

For larger values of n, we construct a blending function B n (u) by convolving 
■Bo(u) with itself n times. It is convenient to choose the distance scale in the plane P 
so that the volume of the hexagonal pyramid is one, that is, so that / f p Bo(u) du = 
1, which corresponds to making each equilateral triangle have area 1/2 and hence 
side-length 8, where s 4 = 4/3. If we do this, each blending function B n (u) defined by 
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Fig. 23.5. A stripe in the domain plane of a triangular-patch spline surface 

B n (u) := {B n -i * Bo)(u) will have the property that its translates form a partition 
of unity. It is easy to see that the function B n (\i) will be nonzero precisely for u in 
a regular hexagon of side-length (n + which consists of 6(n + l) 2 triangles, half 
pointing up and the other half down. The harder part, which we omit, is to show 
that each convolution with J3 0 (u) raises the degree of the blending function by 3 
and raises the level of continuity of its joints by 2. After n such convolutions, the 
function B n (u) has degree 3n + 1 and C 2w continuity. Spline surfaces built using 
the translates of B n (u) as the blending functions inherit these same properties. □ 

A general polynomial surface of degree 3n+l has ( 3n 2 +3 ) Bezier points, while the 
surface patches that arise in the splines of Prop. 23.4 are controlled by only 3(n+ 1) 
points. This means that the latter surface patches must satisfy ( 3n 2 +3 ) ~ 3(n+ 1) 2 = 
3 ( n £ X ) constraints. Just as in the square-patch case, those constraints turn out to 
involve the high-order derivatives of the surface. 

Let F A and F B be two adjacent surface patches of a triangular-patch spline 
surface, and let £ and r\ be vectors parallel to the edges of the triangles as shown 
in Fig. 23.5. Consider the derived surfaces (D n ) n+1 F A and (A,) n+1 F B . Since 
F A and F B are of degree 3n + 1 and meet with C 2n continuity, these derived sur- 
faces are of degree 2n and meet with C n-1 continuity. Therefore, we can apply 
Lemma 22.7 to deduce that the doubly derived surfaces (£>e) n+1 (^r7) n+1 *U and 
{Dz) n + l {Dr,) n+1 F B must be identical. Comparing F B with F c , where C is the 
next region along the stripe, as shown in Fig. 23.5, we can apply the same argu- 
ment with the roles of £ and tf reversed to show that the doubly derived surfaces 
(D^) n+1 (i) f7 ) n+1 J ?, B and (Dt) n + l (D n ) n + l F c are also identical. We conclude that 
all of the triangular patches Fa in any horizontal stripe of the spline surface must 
share a single common value of the (n - l)-ic function (£>e) n+1 (I>r 7 ) n+1 *A. That 
is, these high-order derivatives are not locally flexible. 

The same arguments apply to the stripes in all three directions. Let a, /?, and 
7 be vectors on P that are parallel to the three axes of the triangular grid; it is 
convenient to arrange that a + /? + 7 = 0. For any triangular-patch surface F , the 
derived surface (D a ) n + l (Dp) n + l F A is constant for triangles A in a stripe parallel 
to the nr axis; the derived surface (D a ) n+1 {D^) n+l F A is constant along stripes; 
and (Dp) n + l (Di) n+1 FA is constant along a stripes. 

Just as in the square-patch case, we shall respond to the lack of local flexibility of 
high-order derivatives by setting those derivatives to zero, once and for all. From now 
on, when building triangular-patch splines, we restrict ourselves to patches Fa that 
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satisfy (Z?«) w+1 (D,) w+1 fU = {D*)** 1 ^)"* 1 F A = (2J / f) n+1 (^) n+l F A = 0. 
Note that the patches that are generated by the method of Prop. 23.4 must satisfy 
these constraints, because the blending function B n (u) used to build them includes 
patches that are identically zero, far enough out in every stripe. 

Each function (£> e ) n+1 (D f? ) n+1 F A , where £ and r) are two of {a,£,7>, has 
degree n — 1, so setting it to zero gives up ( n ^ X ) degrees of freedom. If we can show 
that the constraints arising from the three different choices of {£, r)} are independent, 
we will have accounted for all 3( n ^" 1 ) degrees of freedom that are missing in a patch 
drawn with the method of Prop. 23.4. To verify independence, it is easiest to think 
about the tensor space P®^ 3n+1 ^. A constraint on the high-order derivatives of a 
polynomial surface F: P — ► Q of degree 3n + 1 corresponds to a linear subspace 
of p®( 3n+1 ) on which the linear blossom ff is required to be zero. Our three 
constraints correspond to the three subspaces 

{a n + l£n+l e | eG p®(»-l) }j 

{ a n+i 7 "+i e | e e (B - 1} }, and 

{/3 n +l7 n+l e | eGF ®(»-l) } 

These subspaces are certainly pairwise disjoint, since a tensor would have to have 
degree at least 3n + 3 in order to have all three of a, and 7 as factors n + 1 times 
each. Furthermore, the subspace spanned by the first two together lies inside the 
subspace {a n+1 c | c G P® 2n }, which, by the same argument, is disjoint from the 
third. Thus, the three constraints are independent. 

We now come to the point where the analogy between the square-patch and 
triangular-patch theories breaks down, at least with my current state of knowledge. 
The derivative constraints that arose in the square-patch case implied that a square- 
patch spline should be built, not from arbitrary surfaces of degree 2n, but rather 
from bipolynomial surfaces of degree (n; n). In the triangular-patch case, it is once 
again true that the surfaces involved are not arbitrary surfaces of degree 3n + 1. 
But it isn't so clear just what they are instead. In lieu of a better name, let us call 
them triangular-patch surfaces. That is, let P be an affine parameter plane, let a, 
/?, and 7 be three vectors on P that satisfy a +#+7 = 0, and let n be a nonnegative 
integer. We shall call a polynomial surface F of degree 3n + 1 a triangular-patch 
surface if F satisfies the triangular-patch derivative constraints 

(D a ) n+1 {D fi ) n+1 F = (Z?a) n+1 (^) n+1 ^ = (Z?j) n+1 (I> 7 ) w+1 F = 0. 

Warning: Current standard nomenclature uses the term " triangular-patch sur- 
face" to refer to what we are calling simply a polynomial surface, without any 
constraints on the derivatives. Our unconventional definition has the advantage 
that triangular-patch surfaces in our sense are actually the surfaces whose patches 
appear in triangular-patch splines. 

The stage is now set for the juiciest challenge of this challenge-laden paper. 

Challenge 23*6. Find a natural way to blossom a triangular-patch surface F:P — ► 
Q of degree 3n+l, that is, a way that builds in the appropriate derivative constraints. 
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Restating this from the algorithmic perspective, suppose that we are given 3(n+ 1) 2 
control points at arbitrary places in Q. Find a way to construct the value F(u) 
by performing 3n + 1 iterated stages of either one-dimensional or two-dimensional 
linear interpolations whose ratios are determined somehow by the location of u in 
P, starting with the control points. The resulting function F(u) must satisfy the 
triangular-patch derivative constraints, but must be otherwise arbitrary. 

Remark 23.7: Integrating tripolynomial spline flows. A cube viewed along 
its main diagonal looks like a hexagon. This idea can be parlayed into a different way 
of constructing triangular-patch spline surfaces, first exploited by de Boor and Hollig 
in their box splines [7], which might be useful in solving Challenge 23.6. Let us use 
the term flow for a parametric function whose parameter space is 3-dimensional; 
that is, "flow" is the third element in the sequence that starts with "curve" and 
"surface." By exploiting the theory of tripolynomial functions, it is easy to build 
spline flows of degree 3n by assembling cubic cells cut out of tripolynomial flows of 
degree (n;n;n) in such a way as to guarantee C n ~ l continuity across the squares 
where two cubic cells abut. Let A n : U X V x W -> R denote the blending function 
that gives the influence of a particular control point in this method. The base-case 
blending function A 0 is identically equal to 1 on one cube in the 3-space U x V x W 
and is 0 everywhere else, while A n for general n is the (n + l)st convolutional 
power of A 0 . Suppose that we integrate A 0 along each line parallel to the main 
diagonal. As a function of the position of the line of integration, the graph of this 
integral will be a hexagonal pyramid, just like Bo, the base-case blending function 
in Prop. 23.4. Because convolution and integration commute, the same coincidence 
holds for all degrees n. That is, the blending function B n that we used in Prop. 23.4 
to build our triangular-patch spline surfaces, which is the convolutional power of a 
hexagonal pyramid, can be computed by taking the convolutional power A n of the 
characteristic function of a cube and integrating it along lines parallel to the main 
diagonal. This point of view might be a fruitful way to tackle Challenge 23.6. 

Until a solution to Challenge 23.6 is discovered, our only option is to study 
triangular-patch spline surfaces using the blossoms that we get by treating their 
patches as degenerate cases of polynomial surfaces of degree 3n + 1. We should 
expect the answers that we get via this poor man's approach to be rather confusing. 

To get a feeling for how bad this confusion is likely to be, suppose that we were 
studying square-patch spline surfaces, but that we hadn't invented the bipolynomial 
theory. For definiteness, consider bicubic spline surfaces with square patches, using 
the integers {t} and {j} as the knot sequences in both U and V. Let us focus on 
the region A = X [2,3] and the vertex r — (2,2) at its lower left corner, as 

shown in Fig. 23.8. The control point C(r) associated with the vertex r is given by 
C(r) := /(6,i)x(6,4}(i>^M>M), where / denotes the blossom of F as a bicubic 
spline. Without the bipolynomial theory to draw upon, we would have had to 
consider each patch of F as a degenerate hexic surface. Let gA-P* — > Q denote 
the 6-blossom of the patch F Ay while f A : U* x V s -► Q is its (3;3)-blossom, and 
let 8 := (3,2) and t := (2,3). We can express the control point C(r) as an affine 
combination of the B6zier points g% (rVt*) for t + j + k = 6 of ^(Arst); the 
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2 $ 

Fig. 23.8. The domain of one patch of a square-patch spline surface 



formula turns out to be 



C(r) = 



(+5«7®(r 4 t 2 ) -20 ff ®(r 3 8t 2 ) +10 ff ®(rVt 2 ) 
-10<?®(r 5 t) +40<7®(r 4 st) -20s®(r 3 s 2 t) 
+<7®(r 6 ) -10 ff ®(r 8 8) +5 ff ®(r<8 2 ) 



Deriving this formula is a straightforward application of the degree-adjusting formu- 
las from Part D, starting from the fact that C(r) = f A (l, 1, 2, 3). Treating g A as 
known, we can produce a formula for f A in terms of g A by first using Prop. 13.3 to 
split the degree of g A from 6 to (6; 6) and then using Eq. (11.7) twice to lower each 
of the resulting degrees from 6 to 3 separately. Note that the control point C(r) 
had a simple label in terms of f A , but its label in terms of g A is both complicated 
and mysterious. The upcoming formulas for the control points of triangular-patch 
splines are also complicated and mysterious, presumably because they are stated 
in terms of the wrong blossom, a blossom that doesn't have the proper derivative 
constraints built in. 

Fig. 23.9 shows a vertex r of the domain grid of a triangular-patch spline F and 
a region A := Arst adjacent to that vertex. In the method of Prop. 23.4, the patch 
F A is influenced by 3(n + l) 2 control points. By computing the convolutions in 
Prop. 23.4, we can determine the linear relations that give the patch F A in terms of 
its control points. By inverting those relations, we can compute the control points in 
terms of the patch. Let C(r) denote the control point associated with the vertex r. 
The case n = 0 is trivial; we have C(r) = F{r). 

For n = 1, the spline F has quartic patches. Let g A be the 4-blossom of the 
patch F A \ we are using the letter u g n rather than to remind ourselves that the 
derivative constraints on F are not built into g. The control point C(r) can be 
expressed as the following affine combination of the Bezier points of F A (&Tai); 



C(r) = g® 



-2r 2 



+2r 3 s +2r 3 t 
s 2 +2r 2 st -2r 2 t 



,) 
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Fig. 23.9, The domain of one patch of a triangular-patch spline surface 
Since F A is a triangular-patch surface, we must have 

(Z? r _.) 2 (£>r-t) 2 FA = 4!,». ((r - 8) 2 (r - t) 2 ) = 0. 

Therefore, we can get another equally valid formula by adding (r - s) 2 (r - t) 2 to 
the argument of 3®: 

/ -rV +6r 2 st -r 2 t 2 
C(r) = -2rs 2 t -2rst 2 

V +8 2 t 2 

Exercise 23.10. Continuing our study of the n = 1 case in Fig. 23.9, show that 
the following is yet another valid formula for C(r): 

C(r) = y(r,s,u,w) + j(r,t,v,x) - </(r,r,r,r). 

The function g here is the overloaded 4-blossom of the spline surface F. Note that 
the 4-blossoms of all six patches shown in Fig. 23.9 will agree on all three argument 
bags in this formula, since each bag includes two points on each of the three double 
knotlines. This formula can be turned into a geometric construction for the point 
C(r) as follows. The C 2 continuity constraint between A = Arst and Artu is 
enforced by three two-dimensional de Casteljau Diagrams with two shells. The 
central vertices of the outer shells of these three diagrams, going from t towards r, 
are the points </(t,t,s,u), </(r,t,s,u), and $(r,r,s,u). If we draw a line from the 
second of these to the third and extend it the same distance again, we end up at 
</(r,w, 8,11). The analogous constructions on three of the six edges radiating from r 
give the point ^(r,s,u,w), while the other three edges give $(r,t,v,x). These two 
points are opposite vertices in a parallelogram in Q whose other two vertices are the 
surface point F(r) and the control point C(r). 

The mysteries deepen when n = 2. If g a is the 7-blossom of F Ay the control 
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point C(r) is given by 

+57rV -99rVt -99r 4 st 2 +57r 4 t 3 \ 
-20r 3 s 4 -215r 3 s 3 t +771r 3 s 2 t 2 -215r 3 st 3 -20r 3 t 4 
+132r 2 s 4 t -321r 2 s 3 t 2 -321r 2 s 2 t 3 +132r 2 st 4 
-36rs 4 t 2 +275rs 3 t 3 -36rs 2 t 4 

-20s 4 t 3 -20s 3 t 4 / 

The solver of Challenge 23.6 will have the satisfaction of explaining why these mys- 
terious coefficients are correct. 

Exercise 23.11: Osculating a polynomial surface by a triangular-patch 
surface. What should it mean to osculate an arbitrary polynomial surface F of 
degree 3n + 1 by a triangular-patch surface G, also of degree 3n + 1, at a point 
r in Pt The triangular-patch derivative constraints tell us certain derivatives of 
G that have to be zero, and hence can't match the corresponding derivatives of F 
at r. But which of the remaining derivatives should we choose to match? Let / 
and g be the (3n + l)-blossoms of F and G. Consider the tetrahedra of blossom 
values {ff (r'a^V )} and {<j?(r»V/? V)} for i + j + ib + / = 8n + 1. The relation 
oc + /3 + 7 = 0 implies that 

/®(rV+VV) + ff (rV^+V) + /®(r^/?V +1 ) = 0 

whenever i + j + k + I = 3n, and similarly for gf. Since we want G to be a 
triangular-patch surface, we must have gf(r % a J /3 k f l ) — 0 whenever any two of the 
three indices j y k y and / are both at least n+ 1. Show that G is uniquely determined 
by the requirement that gf^a* P k f l ) = ff (r*a J '/9*V) whenever t+min(j, Jfc,/) > n. 
The surface G determined in this way seems to be the natural choice to called the 
triangular-patch surface that osculates F at r. 



2C(r) = #„ 



Part G: The Rational Case 



At the outset, we made three decisions about how we would model shapes: our 
models were to be piecewise, parametric, and polynomial. There are some situations 
in computer-aided geometric design where the restriction to polynomial curves and 
surfaces is unpleasant. For example, circles are not polynomial curves. Of course, 
they can be approximated arbitrarily well by polynomial splines; but it would be 
simpler if such basic shapes could be handled exactly. A related difficulty is that 
the class of polynomial curves is not closed under projective maps. Extending our 
models to allow rational maps instead of just polynomial ones solves both of these 
problems. In this concluding part, we will investigate how the blossoming technology 
can be extended to the rational case. 

24. On choosing scale factors 

So far, we have studied blossoming in two worlds. In the affine world, we 
blossomed a polynomial map into a multiaffine map, and then tensored to get an 
affine map. In the linear world, we blossomed a homogeneous polynomial map into 
a multilinear map, and then tensored to get a linear map. There is yet a third 
world where blossoming can be applied: the projective world, which consists of 
projective spaces and projective maps between them. Rational maps turn out to be 
the natural notion of a "map of degree n" in the projective world. By analogy, we 
expect that blossoming a rational map should produce a multiprojective map, and 
then tensoring should give a projective map. Roughly speaking, that is what does 
happen. But some subtle issues are involved, associated with choosing scale factors. 
To clarify those issues, we shall start by investigating rational functions in the more 
familiar context of maps between affine spaces. 

If P and Q are affine spaces, a map R: P — ► Q is called rational if every 
coordinate of the point iZ(u) is a rational function of the coordinates of u, that is, 
a quotient of polynomials in the variables u* where u = (u 1 , . . . , u p ). For example, 
the formula S(u) :— (u, 1/u) defines a rational parametric curve S:L — ► A in the 
plane A. In fact, the curve S is one parameterization of the standard rectangular 
hyperbola, which is given implicitly by the equation xy = 1. A rational map between 
affine spaces is undefined at an argument point u that causes the denominator of 
some coordinate of R(u) to be zero. For example, 5(0) is undefined. We don't 
permit the denominator of any coordinate of R to be identically zero; this restriction 
guarantees that R(u) will be well-defined except for u in some set of measure zero 



Computing the degree of a rational map is a bit tricky, as the example curve S 
demonstrates. A hyperbola like S is surely a curve of degree 2; but the definition 
S(u) := (u, 1/u) doesn't involve any polynomial of degree greater than 1. The 
solution for this difficulty is to make the convention that all of the coordinates of 
the point i2(u) must be put over a common denominator before we compute the 
degree. This rule forces us to rewrite the hyperbola S(u) in the equivalent form 



in P. 
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which is obviously quadratic. 

Common denominators are not unique, so we have a choice to make at this 
point. If we are trying to compute the degree of a rational map R, we should choose 
a least common denominator, that is, a common denominator of minimum degree. 
Even then, we will still have to choose a scalar factor: for example, we could equally 
well have written 

S(u) = ( , — ) 

\ wu wu/ 

for any nonzero scalar w. If we are dealing with rational maps of degree n, but 
the particular map R happens to be degenerate and have some smaller degree m, 
it makes perfect sense to adopt a common denominator whose degree exceeds the 
minimum possible by as much asn-m. For example, we could view the hyperbola 
5 as a degenerate rational cubic curve by writing it 

cfirt - / u *( au + b ) au + b \ 
K } \ u{au + b) >u{au + b)r 

where (au + b) is an arbitrary linear factor. 

Once we have made an explicit choice of a common denominator, there is no 
reason to keep separate copies of that denominator in each coordinate of R. Instead, 
it is more convenient to think of each numerator as a coordinate and the common 
denominator as one additional coordinate. Recall, from Section 5, that the affine 
object space Q sits as a hyperplane inside its linearization Q*> which consists of all 
1-tensors on Q of all flavors. Let c: P — * R denote the chosen common denominator 
for the coordinates of the rational function R:P — ► Q 9 and consider the map F:P — ► 
Q* defined by F(u) := c(u)i2(u). Since c(u) is a common denominator, the map F is 
a polynomial map. Furthermore, the rational map R is determined as the projection 
(with center at the origin) of the polynomial map F onto the 1-flavored hyperplane 
Q = Qif sitting in Q*. For example, if we choose the common denominator u for 
the quadratic rational curve S(u) := (u, 1/u) in the plane A, the corresponding 
quadratic polynomial curve G: L — ► A* is the parabola given by G(u) := (u 2 , 1; u). 
Projecting the parabola G down into the plane A in A* gives the hyperbola 5. 
Alternatively, if we decide on u(au+b) as the common denominator for 5, we would 
get the cubic polynomial curve H(u) := {u 2 (au + b),(au + b); u(au + b)), which also 
projects down onto S. 

Proposition 24.1. Let P and Q be affine spaces. The equation F(u) = c(u)J2(u) 
provides a one-to-one correspondence between polynomial maps F;P — ♦ Q* whose 
flavor coordinates are not identically zero and pairs (R,c), where R:P — ► Q is a 
rational map and cz P — ► R is a common denominator for the coordinates of R. 

Proof Suppose that we start with a pair (R>c). We already noted that the formula 
F(u) := c(u)i2(u) will define a polynomial map F:P —> Q*, since c(u) is a common 
denominator for all of the coordinates of i?(u). Furthermore, the flavor coordinate 
of this map F is given by Flav(i ?, (u)) = c(u) and is hence nonzero, since the zero 
polynomial is not a legitimate common denominator. 
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Conversely, given a polynomial map P, let c(ti) := Flav(P(u)) be the flavor 
coordinate of P, which, by assumption, is not the zero polynomial. For all points 
u with c(u) ^ 0, the formula R(u) := P(u)/c(u) defines the value R(xx) to be a 
1-flavored 1-tensor on Q } and hence a point in Q. Since each coordinate of R is 
the quotient of a coordinate of F by c(u), the function R is rational, with c as a 
common denominator for its coordinates. □ 

The stipulation, in Prop. 24.1, that the flavor coordinate of the polynomial 
map F must not be identically zero arises because we have been considering rational 
maps, so far, as maps between affine spaces. We can eliminate this restriction by 
converting to the projective world. That conversion will also reduce the frequency of 
cases where a value of a rational function is undefined, although it won't eliminate 
the problem of undefined values completely. We proceed as follows. 

In Section 5, we saw that the Homogenizing Principle could be used to convert 
a polynomial map M: P — ► Q of degree n into a flavor-exponentiating homoge- 
neous polynomial map M*:P* — ► also of degree n. More generally, suppose 
that we apply the Homogenizing Principle to the polynomial map F:P — * Q* 
obtained from a rational map by Prop. 24.1, whose values P(u) are not neces- 
sarily 1-flavored. The only effect of dropping the flavor constraint on F is that 
we lose the corresponding flavor constraint on P*. That is, the homogenized map 
P*:P* -* Q* will be an arbitrary homogeneous polynomial map of degree n, not 
necessarily flavor-exponentiating. In the hyperbola example, the homogenized form 
of G(u) = (u 2 ,l;u) is G*({u;v)) := (u 2 ,v 2 ;uv). The homogenized form P*, just 
like P itself, depends on the choice of common denominator. If we chose u(ou + 6) 
as the common denominator for the hyperbola, we would get the cubic homogenized 
form 2f*((u;t/)) := {u 2 (au + bv) y v 2 (au + bv)]uv(au + bv)). 

The domain P* and codomain Q* of the homogenized map P* are both linear 
spaces. As we discussed in Section 9, the basic idea of projective geometry is to 
treat the lines through the origin of a linear space as the "points" of a new space, 
called a projective space. If e is a nonzero tensor in a linearized space P*, we shall 
use the expression [e] := {we | w G R} to denote the line through the origin of 
P* that contains c; the expression [0] is not defined. For example, the expression 
[(u;v)] denotes a particular line through the origin of L*, and hence a "point" in 
the projective line. For notational convenience, we shall omit the angle brackets 
in the expression [e] when the 1-tensor e is given in explicit coordinates, writing 
simply [u;v], for example, to denote that point in the projective line. (From now 
on, we shall also elide the quotation marks around the word "point" when referring 
to points in projective spaces.) The numbers inside these square brackets are called 
homogeneous coordinates. They are defined only up to a common scalar multiple; 
that is, it is the ratios between different homogeneous coordinates that are well- 
defined, not the values of the coordinates individually. 

Since the map P* is homogeneous, it carries a line [e] through the origin of 
P* into the line [P*(e)] through the origin of Q*. Let F\*\ denote the set of all 
lines through the origin of P*, which constitutes a projective space of dimension 
p = dim(P), and similarly for We can define a map P[*j:-FUi — ► Q[*\ by 

the equation P[*]([e]) := (P*(e)] whenever the right-hand side is defined, that is, 
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whenever i**(e) ^ 0. The map F[*\ is the projective analog of the rational map R; it 
is a rational map between projective spaces, according to our upcoming definition. 
Note that, since we ignore scalar multiples in the domain and codomain when we 
build the map F\*\ is independent of the choice of common denominator, that 
is, of the scale factor that is built into F*. The hyperbola in our continuing example, 
when viewed as a quadratic rational map from the projective line to the projective 
plane, is given by G[*i([u; v]) = [u 2 , v 2 ; uv]. Note that the map given by 
f]) ~ [u 2 (<uH-6v), v 2 (au+bv)\uv(au+bv)] is the same map as although 
G* and H* are differently scaled. 

If e ^ 0 but F*(e) = 0, then the line [e] is collapsed by F* into the origin of <?*. 
This is a more complete form of undefinedness than the ones we have considered so 
far. It corresponds to a value of the argument u that makes all of the numerators 
and the common denominator of -R(u) simultaneously zero. Such cases will remain 
undefined even in the projective world. The advantage of the projective world is 
that only those cases have to be left undefined. By contrast, a rational map between 
affine spaces is undefined whenever the common denominator is zero, regardless of 
the values of the numerators. 

To clarify the different types of undefinedness, consider the hyperbola example. 
In the affine world, no matter what common denominator we choose, the hyperbola 
5(u) := (u, 1/u) will be undefined when u = 0. If we choose a common denominator 
of more than the minimum degree, we may add more instances of undefinedness. 
For example, using the denominator u(au + b) would make S(~b/a) undefined. In 
the projective world, it is also the case that the frequency of undefinedness depends 
upon the scale factor we choose. If we define G(*j([u; v]) := [u 2 , v 2 ; uv] by using the 
homogeneous quadratic map G*({u; v)) := (u 2 , v 2 ; uv) that results from the common 
denominator u, then G|*] will be defined everywhere. The relation G[*]([0;1]) = 
[0,1; 0] tells us that the value 5(0), viewed projectively, is the point at infinity in 
the y direction. The relation <?[*}([!; 0]) = [1,0; 0] tells us that the projective form 
of the hyperbola also passes through the point at infinity in the x direction, a point 
we might refer to as 5(oo). Suppose, on the other hand, that we use the common 
denominator u(au+6). We then get the function iT[*i([u; v\) ~ [u 2 (au+6v), v 2 (au+ 
bv)\ uv(au+ bv)] t which agrees with G[*j almost everywhere. The value H[*\([— 6; a]), 
however, is undefined; by using a scale factor of higher degree than necessary, we 
have introduced an undefined case. 

Even in the projective world, we can't get rid of undefinedness completely — at 
least, not for surfaces. Rational curves are a special case. The coordinates of a 
rational curve are homogeneous bivariate polynomials, which behave essentially like 
nonhomogeneous univariate polynomials. If a collection of such polynomials have a 
common root, then they, in fact, have a common factor. If we remove all common 
factors by choosing a polynomial form F* of minimum degree, the resulting rational 
curve will be defined everywhere. But the coordinates of a rational surface are 
homogeneous trivariate polynomials, which behave like nonhomogeneous bivariate 
polynomials. The fact that a collection of such polynomials have a common root 
does not come close to implying that they have a common factor. For example, the 
polynomials u and t; have the common root u = v = 0, but they have no common 
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factor. As a result, the rational map of degree one from the projective plane to 
the projective line given by [u, v\ w] h-> [u; v] is unavoidably undefined at the origin 
[0,0; 1]. 

Let F\*\ and Qw be projective spaces, and let F[*):F\*\ — ► be a function 
defined almost everywhere. We shall call a nonzero homogeneous polynomial map 
P*:P* — ► Q* a scaled form of P[*j if the identity P|*i([e]) = [**( e )l holds almost 
everywhere. A map F\*\ between projective spaces is called a rational map if some 
scaled form P* of P[*| exists. The degree of the rational map P[*], written deg(P[*]), 
is the minimum n with the property that F\*\ has a scaled form P* of degree n. The 
analog of Prop. 24.1 in the projective world is the following easy proposition. 

Proposition 24.2. Let F\*\ and Q[*\ be projective spaces. Every nonzero homo- 
geneous polynomial map P*:P* — > Q* is a scaled form of a unique rational map 
F[*\:F\*\ — ► Q[*\, which can be determined by the identity P[*]([e]) = [P*(e)]. On 
the other hand, every rational map P[*j has an infinite number of scaled forms. If 
we pick one scaled form P* of minimum degree as the reference, a nonzero homo- 
geneous polynomial map G*: P* — * <?* will be a scaled form of F\*\ if and only if 
G*(e) = W(e)F*(e) for some homogeneous polynomial W:P* — ► R. In particular, 
the various scaled forms of minimum degree are scalar multiples of each other, and 
they can be distinguished from scaled forms of more than the minimum degree by 
the property that their coordinate polynomials have no nontrivial common factor. □ 

The easiest way to handle a rational map P[*j is to choose an explicit scaled 
form P* right at the outset. We can apply the linear-world variants of blossoming 
and tensoring to this scaled form, with the results shown in the right-hand column 
of Fig. 24.3: blossoming converts the scaled form P* into a multilinear map /*, and 
tensoring then gives us a linear map /®. Restricting these three maps to arguments 
that are 1-fiavored gives us the maps in the middle column of Fig. 24.3: a polynomial 
map P, its multiaffine blossom /*, and its affine blossom ff. These six maps are the 
six different scaled guises of a rational map. Note that the middle and right-hand 
columns in Fig. 24.3 are almost identical to all of Fig. 5.1. The middle column of 
Fig. 24.3 differs from the left-hand column of Fig. 5.1 only in that the codomains are 
Q* instead of Q in each case. The right-hand column of Fig. 24.3 differs from the 
right-hand column of Fig. 5.1 only in the lack of the constraints that the functions 
in each box must carry 1-flavored arguments to 1-flavored results. The differences 
between Fig. 24.3 and Fig. 5.1 form a consistent set, in the sense that the six scaled 
guises of a rational map are in one-to-one correspondence: given a map of any one 
of the six types, unique maps of the other five types exist that correspond. 

The multiple dividing lines in the left half of Fig. 24.3 are warnings that some of 
those correspondences are not one-to-one. As described in Prop. 24.2, moving from 
a rationed map P[*j in the upper-left box to some scaled guise in the middle or right- 
hand column involves choosing a polynomial scale factor of degree n — deg(P[*i). 
Even when deg(P(*j) = n, there is a scalar scale factor to choose; the second thin 
vertical line represents the choice of that scalar. The thick line separating the upper- 
left box from all the rest represents the choice of the scaling polynomial modulo its 
scalar part, which is a nontrivial choice only when deg(P[*]) < n. 
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Fig. 24.3. The nine guises of a rational map 

The lower-left box in Fig. 24.3 is the easier of the remaining two to deal with. 
If F\*\ and Q\*\ are projective spaces, a map H\*\:P[*\ — ► g ( *] is called projective 
if it has a linear scaled form #*:P* — ► g*. That is, projective maps are precisely 
rational maps of degree one. Ignoring scalar multiples in the domain and codomain 
of the linear blossom ff :Pf n — ► g* gives us the projective blossom, which is a 
projective map f { % : P ( ® n Q w . The second thin line separating the lower-left box 
from the six scaled guises warns that choosing a particular linear scaled form of the 
projective blossom involves choosing a scalar scale factor. 

To handle the remaining, middle-left box in Fig. 24.3, we must define the con- 
cept of a multiprojective map. For our current purposes, the appropriate definition 
is as follows: a map f M : (F\*)) n -* Q w is multiprojective if it has a multilinear scaled 
form, that is, if a multilinear map /*: (P*) n -+ g* exists that satisfies the identity 
/[*l([ c i]> • • - , [e n ]) = [/*(ci, - • . , e n )] for almost all n-tuples (e u . . . , e n ). (Section 26 
considers a weaker notion of multiprojectivity and studies how that notion relates 
to the notion defined here.) To convert a multilinear blossom into a multiprojective 
blossom, we ignore scalar multiples in all of the domain spaces and in the codomain 
space. Choosing a particular multilinear scaled form for a multiprojective blossom 
involves choosing a scalar scale factor; hence, there is a second thin line separating 
the multiprojective blossom from the six scaled guises. 

Applying blossoming and tensoring in the projective world corresponds to mov- 
ing down the left-hand column of Fig. 24.3. Tensoring is a one-to-one correspon- 
dence in the projective world, because the same scalar scale factor that we must 
choose when going from the multiprojective box to the multilinear box is washed 
out again when we move from the linear box to the projective box. But blossoming 
is not a one-to-one correspondence in the projective world. Moving from rational 
to homogeneous-polynomial involves choosing a polynomial scale factor of degree 
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n - deg( J F[*]). Only the scalar part of this polynomial factor gets washed out again 
when we return from multilinear to multiprojective; the thick line separating the 
rational box from the multiprojective one represents the remaining choice. 

Warning 24.4, The multiprojective n-blossom of a rational map between pro- 
jective spaces is not unique when deg(F[*i) < n. 

As an example of Warning 24.4, take the identity function G|*i: -* on 
the projective line which is a rational map with deg(G[*]) = 1, and consider it 
as a degenerate case of a quadratic rational curve. Quadratic scaled forms G* for 

have the form G*((u;v» := {u(au + bv)]v(au + bv)) for scalars a and 6, not 
both zero. The bilinear blossom j* of G* is given by 

/ UjVj + 1*2^1 U1V2+U2V1 \ 

0*((u i; vi>, (u 2 ; v 2 » := lau^ + b - ; a - + bv x v 2 J . 

When we project this down to get a biprojective blossom 01*1 for G[*i, we find that 



0[*i([ui;vi],[u2;v2]) := 



au\U2 



t u x v 2 + u 2 v x u x v 2 + u 2 v x . 
+ b ; a + bv x v 2 



On the diagonal, we have Si*]([u; v],[u; t/]) = [u(au + bv)\v(au + bv)) = [u;v], in- 
dependent of the scalars a and 6. But the off-diagonal values of the blossom g\*\ 
do depend on the choices of a and b. This phenomenon may seem less mysterious 
when considered in the affine world, where v = v x = v 2 = 1, and the corresponding 
formulas are 

, x u(au + b) au 2 + 6u 

G W («) = «= au + b 



aui« 2 + 6(tii + «2)/2 
yi * ,(ui,U2) = a(u x + u 2 )/2 + b ■ 

Note that yi*)(6,l) = 6/(a + 26). Thus, the off-diagonal values of the biprojective 
blossom gi*\ do depend on the ratio a : 6, which is the part of the polynomial scale 
factor au + bv that remains when scalar factors are ignored. 



25. Controlling and joining rational curves 

In brief, the moral of Section 24 was the following: the way to draw a rational 
curve or surface in an affine space Q is to draw a polynomial curve or surface in 
the linearization Q* and then to project down onto the hyperplane Q = Q%. In this 
section, we shall investigate a few of the consequences of this two-step approach. 

To begin with, suppose that we want to draw a segment of a rational curve 
p w: i w q m 0 f degree n. One of the six guises of any scaled form F* of is 
an n-ic polynomial map F: L — ► Q*. We shall focus on that guise, since a polynomial 
curve of degree n is such a familiar object. 

The obvious way to control the scaled form F is to specify its Bezier points with 
respect to some reference interval [5, i] for L. The name Bezier tensor is probably 
better in this context, since a Bezier point of F is a point in Q*, that is, a 1-tensor 
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on Q. When specifying a Bezier tensor e, it is common practice to adopt a rather 
unusual, non-Cartesian coordinate system for the space <2*. Instead of specifying 
the coordinates of e directly, we specify the coordinates of the point x in Q to which 
e projects, which is called a Bezier point of the rational curve F[*) 7 and we separately 
specify the flavor w := Flav(e) of the Bezier tensor e, which, in this context, is called 
its weight For example, suppose that Q is a plane. We shall specify a Bezier tensor 
e in the 3-space Q* by giving x, y, and tv where e = (wx, wy\ it;); that is, the tensor e 
projects to the Bezier point x = <x, y) in Q, and the weight of e is w. The advantage 
of this approach is that the designer generally prefers to think about the geometry 
of Q rather than the geometry of Q*, and hence the projected coordinates x and 
y are more relevant than the unprojected coordinates wx and wy. One difficulty 
with using x, y, and w as coordinates on Q* is that tensors in Q Qy that is, vectors 
on Q, don't have well-defined coordinates. For example, there are no values of x, 
y, and w that will make (wx,wy;w) = <1,3;0). This is not a crippling problem in 
practice, however, since the weights of Bezier tensors are usually restricted to be 
positive numbers. 

Thus, we specify the rational curve segment ^[*i([f,t]) by giving its Bezier 
points and their associated weights, which determines the Bezier tensors of the 
scaled polynomial segment From a blossoming point of view, the Bezier 

tensors c t are the values a := /®(f n ~*i*) in Q* for t in [0,n], and their projections 
in Q (more precisely, in Q w ) are the Bezier points x» := /®](s n ~*f *). The weight 
Wi of the tth Bezier point is the scalar by which that point must be multiplied in 
order to give the tth Bezier tensor: e< = w t x t . The rational curve F[*\ is given, after 
projection into Q> by 



0<i<n 

and its multiprojective blossom /[*] is given by 

e n(S) n(^)«v,x m 

f /- _ x /UJ = {l,... ( n} 



£ n(S)n(^)«v, ' 

/uJ=(l t ...,n} 

Going from a rational curve to its Bezier tensors involves choosing a scale factor, 
as we discussed in Section 24. The scalar scale factor represented by the second thin 
vertical line in Fig. 24.3 manifests itself in the fact that we can multiply all of 
the weights Wi of all of the Bezier tensors by a common scalar without affecting 
the rational curve F w . The polynomial scale factor represented by the thick black 
line around the rational box in Fig. 24.3 arises only when a degeneracy of degree 
is involved. When it arises, however, it affects the positions of the Bezier points 
as well as their associated weights. This phenomenon is precisely the import of 
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Warning 24.4, and the example given their works here as well; perhaps that example 
is worth reviewing. Suppose that we want to describe the x-axis of the plane Q under 
the trivial parameterization F\*)(u) = (u,0) as a degenerate case of a quadratic 
rationed curve, and suppose that we choose the standard reference interval [0, 1] 
for the parameter line L. The first and last Bezier points of jF[*i([0,I]) are clearly 
determined: /(*i(0,0) = F w (0) = (0,0) and /,*,(!, 1) = F w (l) = (1,0). But the 
location of the middle Bezier point along the x-axis and the weights of the three 
Bezier tensors depend upon the common denominator (au + b) in the formula 



, x /u(au + b) 0 \ 



Choosing the denominator (au + 6) results in the scaled parabola F(u) — {au 2 + 
6u,0;au + b) in Q*, whose biaffine blossom is 

/«,„«,) = + 6 + . 0; . (S±«l) + ») . 

Thus, the middle Bezier point will be located at /i*i(6, 1) = (b/{a+ 26), 0), and the 
weights of the three Bezier tensors will be 6, 6 + a/2, and b + a. 

There is another type of redundancy involved in the choice of the weights of the 
Bezier tensors, if the details of the parameterization of the curve segment F[*\ ([s, t ]) 
don't matter to us. Note that there is a one-parameter family of projective maps 
<p: — ► from the projective line to itself, that fix the two points s and t. 
In particular, the map 

(j=g)-+(?=flg 

* u) (|E5).+ (SEf)» 
fixes 8 and i while taking the midpoint (s + i)/2 to the point (as + bt)/(a + b) 9 
which can be anywhere in the interval (M), depending upon the ratio a : 6. We can 
think of the numbers a and b as the weights of the endpoints s and t of the reference 
interval, viewed as tensors in L* rather than as points in L. If F\*\ is a rational 
n-ic, then the composite map u F\*\(<p{u)) will also be a rational n-ic, and 
the segment ([$,?]) will be the same as ([*,*]) except for the details of the 
interior parameterization. The Bezier points of will be the same as those of 
^1*1 » with the weights readjusted as follows: 

? ®(5 n --*P) = a n - i 67®(5 n -*P). 

There wets no phenomenon analogous to this redundancy in the polynomial case, 
since the only affine map from the line L to itself that preserves the endpoints of 
the reference interval [l, i] is the identity. 

The case of rational quadratic curves deserves some special attention, since 
the rational quadratics are precisely the well-known conic sections. A segment of a 
rational quadratic has three Bezier points, say x, y , and z, each with a corresponding 
weight, say u, v, and w. Scaling all three weights by the same factor doesn't change 
anything. More generally, replacing the three weights by a 2 u, a&v, and b 2 v for a ^ b 
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changes only the parameterization of the conic segment. Thus, the sole effect of 
the weights on the shape of the segment is contained in the ratio t; : y/uw. For 
simplicity, let us assume that the weights are all positive (or all negative, it doesn't 
matter). If v 2 = uw, the conic segment is the parabolic arc whose Bezier points, in 
the polynomial sense, are x, y, and z. If v 2 > uw y we have an arc of a hyperbola; 
if v 2 < utu, less than half of an ellipse. This behavior helps to justify the name 
"weight" for the Savor of a Bezier tensor, since increasing the weight t; associated 
with the apex y of the control triangle Axyz pulls the conic segment up towards 
that apex. In fact, let m denote the midpoint of the base xz of the control triangle. 
The parameter v/y/uw is projectively related to the point r where the conic segment 
cuts the median line ym by the formula 

v r - m 

y/uw y — r * 

If u and w have the same sign, but t; has the opposite sign, we get the complements of 
the segments above: a parabolic arc that includes the point at infinity, a hyperbolic 
arc including both points at infinity, or more than half of an ellipse. When ti and w 
have opposite signs, the endpoints x and z are on different branches of a hyperbola 
(and the conic does not intersect the median line, over the real numbers). 

Given a fixed control triangle, it makes an interesting geometric problem of the 
old school to compute the value of the parameter v/y/uw that gives the ellipse of 
minimum eccentricity, as described in the following pair of exercises. Of course, ec- 
centricity is not even an affine invariant, so both exercises end up having to compute 
a ratio between distances in different directions. 

Exercise 25.1. Let D be a diameter of an ellipse, and consider the chords of the 
ellipse that are parallel to D. Show that the locus of the midpoints of these chords 
is another diameter D' of the ellipse, and show that the chords parallel to D 1 have 
their midpoints on D. The diameters D and D* are called conjugate. The major 
and minor axes of an ellipse form one pair of conjugate diameters, whose lengths are 
unequal. Show that each ellipse has a unique pair of conjugate diameters that are 
equal in length; the axes of the ellipse bisect the angles between this pair of conjugate 
diameters. Let Axyz be a control triangle for a conic segment, and let m be the 
midpoint of the base xz. Show that every ellipse tangent to xy at x and tangent to 
yz at z has a pair of conjugate diameters with one parallel to the base xz and the 
other lying along the median line ym. Finally, show that the ellipse of this form 
with minumum eccentricity is the one in which those two conjugate diameters have 
equal length. See Fig. 25.2. Hint: Problem 54 in Heinrich Dome's book 100 Great 
Problems of Elementary Mathematics [20] considers the more general situation in 
which the ellipses are constrained to circumscribe a quadrilateral. In our case, the 
quadrilateral is degenerate: its four vertices, in order, are x, x, z, and z, while y is 
the point of intersection of its two opposite zero-length sides. 

Exercise 25.3. Algebra and geometry are both paths to the truth; show that the 
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Fig. 25.2. The ellipse of minimum eccentricity in the control triangle Axyz 

ellipse of minimum eccentricity is characterized algebraically by the formula 

llz-xl| 2 
uw 2(||y-x|| 2 + ||z-y|| 2 )' 

A rational curve segment jF[*j([»,t]) of degree n for n > 3 has n + 1 Bezier 
tensors, each with its own weight. Multiplying all of the weights by a common 
scalar has no effect. Multiplying the weights by a sequence of scalars in geometric 
progression changes the parameterization, but not the shape. The remaining n — 1 
degrees of freedom in the weights actually give different shapes for *i*i([M ]). 

The convex-hull property carries over to the rational case, as long as all of the 
weights are positive. If we think of the scaled form ^([M]) as a polynomial curve 
in Q*, the points F(u) for u in [s y i] must remain in the convex hull of the Bezier 
tensors in Q*. If all of the weights are positive, the projection in Q of this convex 
hull will be the same as the convex hull in Q of the Bezier points, which are the 
projections of the Bezier tensors. 

The weight-based approach generalizes to rational surfaces of degree n or to 
birational surfaces of degree (m; n) with no difficulty. We just draw a polynomial 
or bipolynomial surface in the linearized space by using weights to turn Bezier 
points into Bezier tensors, and then project down into the actual object space Q. 
For example, a quadratic rational surface has six Bezier tensors with respect to 
any reference triangle Arst for its parameter plane. The six weight coordinates 
correspond to one irrelevant uniform scalar factor, two degrees of freedom in the 
choice of parameterization, and three degrees of freedom in the shape of the surface. 
Again, the convex-hull property will continue to hold, as long as all of the weights 
have the same sign. 

In most cases, the weight-based approach is an effective compromise between 
thinking of the affine geometry of the polynomial scaled form F in versus thinking 
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of the projective geometry of the rational map F\*\ in Q C Qy*\. The projection 
operation that transforms the former into the latter is simple enough that much of 
the structure that we analyzed in the polynomial case carries over to the rational 
case essentially intact. For example, since the projection of an osculating flat is 
the same as the flat that osculates the projection, we can apply the differential 
perspective on the meaning of a blossom's n arguments to compute a value of the 
multiprojective blossom of a rational map by intersecting osculating flats. 

Despite the power of the weight-based approach, some designers are so wedded 
to the geometry of Q rather than that of Q* that they aren't comfortable dealing 
with the flavor components of the Bezier tensors directly, even if those components 
are referred to as weights. Gerald Farin suggested an alternative approach for 
the case of curves [22]. Suppose that the designer begins by specifying all of the 
Bezier points x $ := f^\(s n ~ % i % ), but none of the weights. At this point, we know 
x o = /(®(5 n )) and we know Xi = /®j(s n_1 *). The line connecting x 0 and x x 
in Q\*) must consist of all points of the form /®](s n ~ 1 w) for u in L{*\ , and the 
correspondence u / ( ® (s"" 1 *!) must be a projective map. But we don't yet have 
enough information to compute the value f^}(s n ~ l u) for any ti other than s and f. 
One way to supply the missing information is to specify the ratio wq : w\ } where 
wo and w\ are the weights of the Bezier tensors €q = wqjCo and €\ = wiXi. Farin 's 
alternative approach is to specify the blossom value n ~ l u) for some third 

chosen value q of u, perhaps q := (i + i )/2. Once three values of the projective map 
ii /®j(5 n " 1 u) are known, all other values can be computed by exploiting the fact 
that projective maps preserve cross ratios; equivalently, the ratio wq : w\ can be 
computed. In a similar way, Farin suggests that the designer specify a third point 
somewhere along the tth edge of the control polygon to be the value f®)(s n ~ % q t* -1 ), 
for i in [l,n]. Sliding this point along the edge between x,_i = /(fj(« n "*f*) and 
x * = /(*i(* n ~*~ 1 f i+1 ) corresponds to adjusting the ratio Wi/wi-i over the interval 
(0,oo). Once all the ratios u/»_i : u\- are known for i in [l,n], then all the weights 
are known up to a uniform constant multiple, which doesn't matter. 

It is interesting to observe that Farin 's technique does not generalize gracefully 
to surfaces. Let F{*) be a quadratic rational surface in Q; let F 7 with values in 
Q*, be a quadratic polynomial scaled form of F\*\\ let u/(u,v) := Flav(/(u,v)) be 
the biaffine blossom of the weight coordinate of the scaled form F; and let Arst 
be a reference triangle in the parameter plane P. Consider the three Bezier points 
/(*l(r,r), /[*)(r,s), and /[*](r,t). Knowing those three points as points in Q is 
enough to determine the plane {/(*i(r,u) | u G fl*]} as a whole, but we need 
extra information to compute any particular point /i*j(r,u), unless u G {r,s,t}. 
In the weight-based approach, that extra information comes from the ratios of the 
three corresponding weights u>(r,r), w(r,s), and w(r,t). By analogy with Farin 's 
technique for curves, the designer could also supply that information by specifying 
the location of /[*i(r, q), where q is a fixed point in Arst, perhaps q := (r+s-f t)/3. 
But we get into trouble if try to apply this same trick to all three of the triangles that 
account for the first stage of the de Casteljau Algorithm. Specifying the location 
of /(*j(r,q) determines, among other things, the ratio u/(r,s) : u/(r,t); specifying 
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Fig. 25.4. A two-segment conic spline curve 



the location of /|*](s,q) determines the ratio w(r,s) : tu(s,t); and specifying the 
location of /[*](t,q) determines w(r,t) : w(s,t). Hence, the locations of the three 
points /i*](r,q), /i*](s,q), and /[*i(t,q) are not independent. Once two of those 
three points are chosen, the third is constrained to lie along a line, not just in a 
plane. 

There is one important area where the existence of the projection that takes 
a polynomial thing into a rational thing makes life more complicated, by opening 
up new possibilities: continuity conditions for rational splines. When deciding how 
to assemble pieces of polynomial maps to form a polynomial spline, we considered 
two different types of continuity constraints: parametric and geometric. Geomet- 
ric continuity was a more appropriate constraint, in many cases. But parametric 
continuity was technically easier, since it interacted very neatly with blossoming. 
In the rational case, there are four different types of continuity constraints, since 
we can demand either parametric or geometric continuity either before or after the 
projection has happened. 

Let's consider a simple case: assembling a two-segment conic spline in a plane 
A. Suppose that, for u in FT,0], we have decided to follow the unit circle with its 
standard rational parameterization. That is, in the 3-space A*, we shall follow the 
parabolic arc F(Fl,6]), where F{u) := (1 - u 2 ,2u; 1 + u 2 >. The biaffine blossom 
/ of F is given by /(u, v) := <1 - uv,u + v; 1 + uv). When_Fand / are projected 
down from Q* into Q, they become the quarter-circle F[*\ ([-1, 0]) shown in Fig. 25.4 
and its biprojective blossom_/i*j. The three Bezier points of this quarter-circle are 
= (0,-1), /[*](-l,0) = (1,-1), and /|*](0,6) = (1,0), and the associ- 
ated weights are 2, 1, and 1. (Plugging these values into the result of Ex. 25.3 shows 
that a circle, with eccentricity zero, is indeed an ellipse of minimum eccentricity.) 
Our task is to determine what conic segments G[*i([0, 1]) can follow l*i*]([-l,0]). 

To keep things simple, let us demand that the curves F and G join with C 1 
continuity at u = 0, even before they are projected from A* into A. This require- 
ment fixes the first two Bezier tensors of at the values y(0,0) := (1,0; 1) 
and ff(0,l) := (1,1; 1). We shall consider what various second-order continuity 
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conditions imply about the value of the third Bezier tensor j(I, 1). 

The strictest choice is to demand C 2 continuity before projection, which forces 
tjie parabolas F and G to be equal, giving 0(1,1) = /(I, I) = (0,2; 2). After 
projection, the arc <7[*]([6, 1]) simply continues around the next quarter of the unit 
circle. In this case, the joint between F and G in A* is surrounded by a de Casteljau 
Diagram with two shells. This diagram gets distorted by the projection, however, 
since the center vertex /(-1,I) = flf(-l, 1) = (2,0;0) of the outer shell happens to 
be a vector on A, and hence projects to a point at infinity in A\*\ . 

One possible weaker notion would be to demand G 2 continuity before projec- 
tion. From the theory in Section 20, we can calculate that this leads to the constraint 
£(1,1) = (0,2+ a; 2), where the scalar a denotes the shape parameter fa at the joint. 
In A*, the joint between F and G is guarded by an A-frame with A = 2/(a + 2), 
whose apex is located at ((a + 4)/(a + 2), -a/(a + 2); a/(a + 2)). When such a G is 
projected down into A, the result is an arc of an ellipse that ends on the y-axis at 
the point (0,1 + a/2). 

We get weaker continuity constraints if we ask for continuity only after pro- 
jection, instead of before. Suppose that we want the curves F\*\ and G[*], after 
projection, to join with C 2 continuity. That is, we will allow the planes containing 
F and G in A* to be tilted with respect to each other, hence precluding any chance 
of second-order continuity before projection, as long as that tilt is washed out by the 
projection down into A. A little algebra reveals that this constrains the third Bezier 
tensor be of the form <?(1, 1) = (6,2; 2 + 6), where 6 is a parameter that measures 
the amount of tilt. 

The weakest of the four notions is to demand G 2 continuity after projection, 
which is just curvature continuity of the resulting conic spline. In this case, the third 
Bezier tensor y(1, 1) := (6,2 + a; 2+6) is affected both by the shape parameter a and 
by the tilt parameter 6. After projection, the resulting conic segment G[*|([0, 1]) will 
end at the point ffi*](I, I) = (6/(2 + 6), (2 + a)/(2 + 6)), which could be anywhere 
in the plane. Choosing 6 in the range (— l,oo) gives arcs of ellipses that end with 
x in (-1,1). Choosing 6 = -1 gives arcs of parabolas that end with x = -1; the 
example drawn in Fig. 25.4 happens to have 6 = — 1. Choosing 6 in (—2, -1) gives 
arcs of hyperbolas that end with x in (— oo, — 1). Choosing 6 in (— oo,— 2) causes 
the points 0[*j(6,O) and £[*i(I,I) to be on opposite branches of a hyperbola, since 
the third Bezier tensor £(1,1) then has a negative weight. 

Whenever the tilt parameter 6 is nonzero, there is no A-frame in the space A*, 
since the tilt causes the tangent lines Osci ^(^1) and Osci G(l) to be skew. The 
corresponding lines after projection, which are Osci F[*]( =: T) and OsciG[*i(I), do 
intersect, giving something that looks rather like an A-frame except that its apex 
has two different weights associated with it. 

The expedient approach to drawing a rational spline curve or a rational spline 
surface in a space Q is to draw a polynomial spline curve or spline surface with 
C k continuity in <2*, and then project it. But this approach is somewhat sloppy. 
Even if the parameterization of the resulting rational spline is important to us, 
C k continuity after projection would presumably be sufficient, rather than before 
projection. If we are simply modeling a smooth shape, then G k continuity after 
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projection should be enough. The problem with these weaker notions of continuity 
is that they don't interact well with blossoming. 

Challenge 25.5. Use the differential perspective on the meaning of a blossom's n 
arguments to define the blossom of a rational spline curve that has G k continuity 
after projection. In particular, if h: x — ► A|*j is the blossom of the two- 
segment conic spline in Fig. 25.4, the point in A[*] that forms the apex of the 
apparent A-frame should be labeled ^(^T, 1), even if a nonzero tilt causes that point 
to have two different weights. The resulting blossoms will fail to be multiprojective 
for two different reasons: nonzero shape parameters, as in Challenge 20.6, and 
nonzero tilts. Construct an enlightening theory of such blossoms. 

Challenge 25.6. Develop useful techniques for controlling rational spline curves 
and surfaces whose joints are G k continuous after projection. 

26. Multiprojectivity in the weak sense 

In Section 24, we defined multiprojectivity by saying that a map fi*\:(Pi*]) n -* 
<J[*] is multiprojective if it has a multilinear scaled form, that is, if there exists a mul- 
tilinear map /*: (P*) n <?* with the property that the identity /i*]([ei], . . . , [e w ]) = 
[/*(e x , . . . , e n )] holds for almost all n-tuples (e x , . . . , e n ) in (P*) n . The analogy be- 
tween the words "multiprojective" and either "multilinear" or "multiaffine" suggests 
a definition along the following lines instead. A map /[*) should be multiprojective 
if the value . . . , [c n ]) is a projective function of each argument whenever all 

of the other arguments are held fixed at arbitrary values. For example, for any fixed 
values of [e 3 ] through [e n ], the map [ci] ■ . . , [e n ]) should be projective, 

meaning that it should have a linear scaled form: a linear map g: P* — ► Q* that 
makes the identity /i*i([e x ], . . . , [e n ]) = [ff(ci)] hold for almost all ci- The defini- 
tion suggested by this analogy is more liberal than the definition that we gave in 
Section 24, because the scaled form g is allowed to depend in an arbitrary way on 
the fixed arguments [e 2 ] through [e n ]. In this section, we shall explore this weaker 
notion of multiprojectivity, contrasting it with the stronger notion of Section 24. 
Multiprojectivity in the strong sense is the interesting concept from the point of 
view of practical applications in computer-aided geometric design. But there is in- 
triguing mathematics involved in exploring the weak sense. The exploration of this 
mathematics was joint work with my colleague James B. Saxe. 

We shall concentrate on the simplest case of multiprojectivity: given a function 
/l*]: X — ► whose two arguments and result value all lie in the projective 
line we shall ask whether /{*] is biprojective in either the weak or the strong 
sense. For brevity, let us refer to a function, like from the cross product of the 
projective line with itself to the projective line, as a pairing. If /t*](x,y) is a pairing, 
the maps x /i*](x,y) for fixed y and y t-* /i*i(x,y) for fixed x are called sections 
of /[*]. The basic intuitions behind the two types of biprojectivity are as follows: a 
pairing is weakly biprojective if all of its sections are projective, that is, have linear 
scaled forms; a pairing is strongly biprojective if it has a bilinear scaled form. 

Much of the complexity of weak multiprojectivity comes from the handling of 
undefined values. In Section 24, we adopted a particular approach towards unde- 
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finedness in projective and rational functions, which might be called the almost- 
everywhere approach. In this approach, a projective function is considered to be 
defined only almost everywhere, in the sense of integration theory. The values of 
such a function can be changed arbitrarily on any set of measure zero without af- 
fecting the identity of the function. As a result, the formula that relates a projective 
function F[*\ to one of its scaled forms F* is required to hold only almost everywhere; 
that is, we require F[*]([e]) = [F*(e)] only for almost all e. Unfortunately, if we follow 
the almost-everywhere approach, it turns out that the concept of weak biprojectivity 
is much weaker than one might think, as demonstrated by the monstrosity g\*\ in 
the upcoming Example 26.1. 

We shall use x, y, and z to denote points in the line £[*], each of which is 
given in homogeneous coordinates in the form x — [x\\ Xo], where Xq and x\ are real 
numbers, not both zero. From the affine- world point of view, we have x = xi/xo- 
In the almost-everywhere approach, a pairing /i*](x,y) is weakly biprojective if and 
only if, for each fixed y, there exist four constants a, 6, c, and d, not all zero, such 
that the equation /i*](x,y) — [ax\ + 6xo;cxi + dxo] holds for almost all x, and 
similarly with x and y interchanged. A pairing /[*j(x,y) is strongly biprojective if 
and only if there exist eight absolute constants a, 6, c, rf, p, qr, r, and s, not all zero, 
such that the equation 

/(s,y) = [axiy x + 6xiy 0 + cx 0 yi + dx 0 yo; pxiyi + qziyo + rx 0 yi + sx 0 yo] 
holds for almost all pairs (x,y). 

Example 26.1. To construct this example, we assume the truth of the Contin- 
uum Hypothesis and of the Axiom of Choice; in particular, we assume that the real 
numbers can be well-ordered by the elements of fi, where Q denotes the smallest 
uncountable ordinal. If the real numbers can be so well-ordered, certainly the pro- 
jective line over the reals can be also. Pick such a well-ordering, and consider 
the set S of all pairs (x, y) in x with the property that the index of x is 
less than the index of y. For each fixed y, the set of x values with (x, y) in S is 
countable, and hence a set of measure zero in £[*). On the other hand, for each 
fixed x, the set of y values with (x,y) in 5 has a countable complement. Define a 
pairing gw by the rules 



For each fixed x, we have y[*i(x, y) = [1; 1] for almost all y; for each fixed y, we have 
01*1 ( x >!/) = [2;1] for almost all x. Hence, in the almost-everywhere approach, g[*\ 
is weakly biprojective — in fact, weakly biconstant. But g\*\ certainly isn't strongly 
biprojective; it isn't even measurable, as a bivariate function. 

The pairing yi*j of Example 26.1 shows that, in the almost-everywhere ap- 
proach, the notion of weak biprojectivity is too weak to be very interesting. We 
could try to strengthen the notion by adding a second clause to the definition, 
which would somehow constrain the behavior of the pairing /[*i(x, y) as a function 
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of two variables. But, once we are forced to constrain /(*](x,y) as a bivariate func- 
tion, rather than just constraining its sections, strong biprojectivity becomes the 
natural notion. 

There is an entirely different approach to the problem of undefinedness in pro- 
jective and rational functions, which we could have adopted in Section 24 instead of 
the almost-everywhere approach; we shall call it the indeterminate-point approach. 
In this approach, we invent a special point, called the indeterminate pointy and we 
add it by fiat to every projective space. The formerly undefined symbol [0] is then 
reinterpreted as naming the indeterminate point. With projective spaces augmented 
in this way, we can demand that the relation F[*j([e]) = [i**( e )l between a projective 
map and its linear scaled form must hold for all e, without exception. In particular, 
the case e = 0 demands that F\*\ ([0]) = [0]; that is, a projective map : P[*\ — ► <?[*] 
must take the indeterminate point of its domain F\*\ to the indeterminate point of 
its codomain Qm* For c ^ 0, the relation = [**(«)] demands that F[*\ carry 

a determinate point [e] to [0] precisely when the scaled form F* collapses the line 
[e] through the origin of P* into the origin [0] of Q*. In the indeterminate-point 
approach, we similarly demand that the identity relating a strongly multiprojective 
map to its multilinear scaled form must hold without exception. 

While the almost-everywhere approach tries to ignore undefined values as much 
as possible, the indeterminate-point approach builds undefinedness, and the precise 
locations of the undefined values, into the concept of a projective map. For example, 
a function L[*i — ► that is equal to a constant F[*](x) := w = [u>i; wo] almost 
everywhere is as constant as a function can be in the almost-everywhere approach, 
and is also projective in that approach. In the indeterminate-point approach, in 
order for F\*\ to be projective, it must satisfy the identity F[*\(x) = f° r 
some linear scaled form F* and for all x. If we want F\*\ to be as constant as 
possible, our best choice is to have F[*\(x) := [u>i(axi + bx 0 );wo{axi + bx 0 )] for 
some real constants a and 6, not both zero. This gives us F\*\(x) = w for all x 
except for the two special cases x = [-6; a] and x — [0;0] = [0], where we have 
F|*i([-6;a]) = F[*]([0]) = [0]. Thus, in order to be considered projective in the 
indeterminate-point approach, an almost-everywhere-constant map Fi*i must carry 
precisely one determinate argument point to the indeterminate result point; further- 
more, the identity of that special argument point is a distinguishing characteristic 
of the function 

Remark 26*2* The restrictions placed on almost-everywhere-constant functions 
F\*\: — ► if they want to be projective in the indeterminate-point approach 
have a certain charm. The facts that there is a unique x with x ^ [0] but F[*](x) = 
[0] and that this x must be specified mean essentially that the polynomial scale 
factor that converts the constant function F\*) from degree zero to degree one must 
be specified, up to a constant multiple. If converting to the indeterminate-point 
approach meant that all polynomial scale factors were automatically specified up to 
a constant multiple, we could eliminate the thick dividing line around the "rational 
map" box in Fig. 24.3. Unfortunately, that hope is not realized, because of the 
problem of multiple roots. Suppose that F[*\(x) := [0] for x in {[0],u,v}, while 
F\*\(x) := w t£ [0] for all other x; and suppose that we want to consider as a 
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degenerate case of a rational cubic curve. The cubic polynomial scale factor must 
have either u as a single root and v as a double root, or vice versa; but there is no 
way to tell which. Thus, we would still need the thick dividing line in Fig. 24.3, 
even if we converted completely to the indeterminate-point approach. (This example 
would work even over the complex numbers. Over the reals, things are worse: the 
map defined by ^i*i([0]) := [0] and F { *)(x) := w ^ [0] for x ^ [0] can be viewed 
as a degenerate quadratic rational curve by using, as scale factor, any quadratic 
polynomial that has no real roots.) 

The good news is that, in the indeterminate-point approach, the monstrous 
pairing g\*\ of Example 26.1 is neither weakly nor strongly biprojective. The bad 
news is that the indeterminate-point approach has its own monstrosities, like the 
following example, invented by Jim Saxe. 

Example 26.3. Let ic: — *• be an arbitrary bijective map, preferably rather 
chaotic. Define the pairing h[*\ as follows. If either x or y is the indeterminate 
point [0], then h M (x y y) := [0]. Also, if y = *(x) f then h w (x y y) := [0], Otherwise, 
A[*](x,y) := [1;1]. The function is weakly biprojective in the indeterminate- 
point approach, since each section parallel to either the x or y axis is the constant 
[1; 1] except for a single indeterminate value. For example, if we fix x, we can write 
>M*l(z>y) = k(z)oyi -*r(x)iyo;?r(x)oyi -*(x)iy 0 ], where ic(x) = [tt(x)i; tt(x) 0 ]; the 
affine-world version of this formula might be more perspicuous: 

y - y(x) 
y - *(x) 

If y is fixed, we have the similar formula 

<■».(*.») =^^4- 

x-*r 1 (y) 

On the other hand, the function h\*\ is not strongly biprojective, since there is 
too much information in jt to be repesented by eight coefficients. (Note that, 
in the almost-everywhere approach, the pairing is both weakly and strongly 
biprojective — in fact, both weakly and strongly biconstant.) 

We have tried two different approaches to handling undefinedness, and, in both 
cases, the natural notion of weak biprojectivity turned out to be too weak to be 
interesting: monstrous pairings existed that were weakly, but not strongly, bipro- 
jective. If the only difference between weak and strong multiprojectivity is that 
the weak case includes various monstrosities, then it probably isn't worth bothering 
with the weak case. We could continue to test this hypothesis by exploring still 
more approaches to handling undefinedness. One interesting possibility would be to 
call a pairing weakly biprojective if almost all of its sections were projective in the 
indeterminate-point sense. But we won't; instead, we shall try to tackle the question 
from a different perspective. What happens, in the indeterminate-point approach, 
if we restrict our attention to the special class of pairings where no undefinedness 
arises at all? Let us call a pairing /[*) total if the value /i*i(x,y) is a determinate 
point whenever both x and y are determinate points. 
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Exercise 26.4. Prove that strongly-biprojective total pairings exist. In particular, 
prove that the pairing given affinely by the formula 

is such a pairing. The projective version of this formula is /[*](x,y) := [xiVo + 
zoyij^oyo — Eil/i]; show that the two homogeneous coordinates on the right-hand 
side can be simultaneously zero over the real numbers only if either Xo = %i = 0 or 
y 0 = y x = 0. Note that this result depends on the fact that the coefficient field is 
the real numbers. Over the complex numbers, no strongly-biprojective pairing can 
be total. 

Restricting our attention to total pairings simplifies things substantially. In 
particular, consider a section of a weakly-biprojective total pairing /[*]; say that 
we fix z and consider the section y *-> /i*i(x,y). By weak biprojectivity in the 
indeterminate approach, there must exist four constants a, 6, c, and <f, not all zero, 
with /i*](x, y) = [ayi + byo] cyi + dy 0 ] for all y. The equivalent formula in the affine 
world is 

Note that this section will be total if and only if it is a fractional linear map y that 
is, if and only if ad-bcj^ 0. The weakly-biprojective pairing /[*] as a whole will be 
total if and only if all of its sections are fractional linear maps. 

Exercise 26.5. Over the real numbers, prove that every weakly-biprojective total 
pairing is actually strongly-biprojective. Warning: the proof that I know is rather 
intricate, and exploits both the topological and algebraic properties of the real 
numbers. 

The result in Exercise 26.5 deals yet another blow to the concept of weak mul- 
tiprojectivity, by suggesting that there are no interesting pairings that are weakly, 
but not strongly, biprojective. While that may be true over the real numbers, Jim 
Saxe discovered that it fails in a fascinating way over another coefficient field, in 
particular, over the field Jfc = Z/3Z. Over this field, there are precisely 4- 144 = 576 
weakly-biprojective total pairings, of which only 3 • 144 = 432 are strongly biprojec- 
tive. We shall close this section (and this paper) by studying the case k = Z/3Z in 
a series of exercises. 

For the rest of this section, let A; denote the coefficient field Z/3Z, which consists 
of {0,1,-1} under arithmetic modulo 3. Before we consider biprojective pairings 
over Jfc, it is worth pointing out that the discrete nature and finite characteristic of 
k cause other theories in this paper to work out rather differently. One example is 
that the almost-every where approach doesn't make any sense when the coefficient 
field is finite and discrete. As a second example, note that computing an n-blossom 
involves dividing by various multinomial coefficients with n as their upper index. 
If n equals or exceeds the characteristic of the coefficient field k> then blossoming 
won't be possible in general. 
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Exercise 26.6. There are 27 set-theoretic functions F:k — ► A:, which are precisely 
the 27 polynomials of the form F(x) := ax 2 + 6x + c for a, 6, and c in k. Polynomials 
of degree 3 or more are not needed, because x 3 = x (mod 3). There are 3 8 = 6561 
triaffine functions f:k z k> of which 81 are symmetric. These 81 symmetric tri- 
affine functions, however, have only 9 different diagonals among them. For example, 
the diagonal of the function /(xi,X2,xs) :— xi + x% + X3 is identically zero, and 
hence agrees with the diagonal of the zero function /(xi,X2,X3) := 0. Deduce that 
some of the quadratic functions over k don't have multiaffine 3-blossoms. In fact, 
the function F{x) :— ax 2 + bx + c has a 3-blossom if and only if a — 0, in which case 
it has 9 different 3-blossoms. 

Exercise 26.7. We now turn to pairings over k. An affine line L over k contains 
three points: 0, 1, and —1. The linearization L* of L is a vector space of dimension 
two over k, which contains 9 points. The projective line L\*\ over k consists of four 
points, which we shall refer to sloppily as 0, 1, oo, and —1, where: 

0 = [0;1] = [0;-1] 

1 = [1;1] = [-1;-1] 
oo = [l;0] = [-l;0] 
-1 = [-1;1] = [1;-1] 

Prove that a set-theoretic map H: L\*\ — ► is a fractional linear mapping if and 
only if H is a permutation on the four letters {0, 1, oo, - 1}. Conclude that a function 
/[*]: x — ► is a weakly-biprojective total pairing if and only if the values 
of /t*] form a 4-by-4 Latin square. 

Exercise 26.8. The structure of a pairing /i*](x,y) is not changed in any essential 
way if we permute the names of the four values of x, or those of y, since every 
permutation is a fractional linear map. Show that such renamings can be used to 
partition the set of all weakly-biprojective total pairings into equivalence classes of 
size 144, where each equivalence class contains precisely one member that satisfies 
the normalization conditions /i*i(ty,0) = /i*i(0,tt;) = it; for w in L\*\. Enumerate 
the 4-by-4 Latin squares by showing that there are precisely four of them that satisfy 
the normalization conditions, the four shown in Fig. 26.9. 
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Fig. 26.9. The four normalized Latin squares of order 4. 

Exercise 26.10. By the rules of the indeterminate-point approach, a pairing 
is strongly biprojective if and only if there exist eight constants, a through d and p 
through s, not all zero, such that 

/l*l(z>y) = [<*xiyi + bxiy 0 + cx 0 yi + dx 0 yol P^iVi + q*iyo + "oyi + sx 0 y 0 ] 

for all x and y. Show that, if the normalization conditions also hold for /[*], then 
there must exist two constants, a and p, such that 

/l*](z>y) = [axiyi + x x y 0 + x 0 y X ] px x y x + x 0 yo] 

for all x and y. The values of such a pairing are given in Fig. 26.11 as functions of 
a and p. 

Exercise 26.12. If we assume that the pairing is total, as well as being strongly 
biprojective and normalized, an explicit check of cases reveals that only three pos- 
sibilities for the two parameters a and p remain: we can have p = 1 and a — ±1 or 
we can have p = — 1 and a = 0. These three cases correspond to three of the four 
weakly-biprojective total pairings given in Fig. 26.9. The fourth (bottom-right) pair- 
ing in Fig. 26.9 represents an equivalence class of 144 total pairings over k = Z/3Z 
that are weakly biprojective, but not strongly biprojective. 

Exercise 26.13. Find a simple property of a weakly-biprojective total pairing over 
k that can serve as a test of strong biprojectivity. Hint: Consider the cycle structures 
of the six permutations formed by picking two rows out of the Latin square. 
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Fig. 26.11. The normalized, strongly-biprojective pairings over Z/3Z 
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